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ABSTRACT 

Institutions are staffed and are created to do the job of regulating organizations. 
This staffing, and all the creative work that is involved in financing, governing, 
training, and motivating institutional actions by that staff in organizations, has 
been lost in recent institutional theorizing. This staffing was central to the old 
institutionalism, which is why it looked so different. The argument is exemplified 
by applying the insights of classical institutionalists to the legitimacy of court 
decisions as determined by the law of evidence, to the legitimacy of competition 
and the destruction of other organizations by competition, to the noncontractual 
basis of contract in commitments to maintain competence to do the performances 
required in contracts, and to the failure of institutions of capitalist competition and 
the substitution of mafia-like enforcement of contracts in postcommunist Russia. 
The institutions of the new institutionalism do not have enough causal substance 
and enough variance of characteristics to explain such various phenomena. 


Introduction 


I was recruited into sociology by the institutional economists, especially by 
Thorstein Veblen; sociologists at Central Michigan College of Education read 
Veblen, while economists there did not. I started with The Theory of the 
Leisure Class (1934 [1899}), and then whatever of the Veblen corpus was 
in the CMCE library. I went on before graduate school to Karl Marx, Joseph 
Alois Schumpeter, and Max Weber, then in graduate school to John Maurice 
Clark, Clark Kerr, John R Commons, and many historians of law (because most 
economic institutions eventually make their way into the courts). I became a 
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sociologist in order to make a living as an institutional economist, since 1t was 
clear that I would make a poor living at that among economists. 

My first intellectual identity, then, was shaped by the old institutionalists. 
They left behind a legacy of problems, more problems perhaps than solutions. 
The generative aspect of the old institutionalism, its openness to the macrosoci- 
ological background structure behind firms, household consumption, and mar- 
kets, gave me things to work on. 

This made Berkeley the place to go for graduate school, because TVA and 
the Grass Roots (Selznick 1949) was about one way to institutionalize elec- 
tricity and fertilizer production, as well as public versus private land tenure; 
Union Democracy (Lipset et al 1956) was about a form of appropriation of skill 
monopolies; Work and Authority in Industry (Bendix 1956) about one way to 
develop institutions of small-firm manufacturing of textiles. Selznick, Lipset, 
and Bendix were, whatever else they also were, institutional economists. 

Institutions, I learned then, shaped the creation and functions of units in 
market and the relations between them. But unlike the institutions of mod- 
ern institutionalism, people ran these institutions by organizing activities on 
their behalf. Institutions were, in the first instance, created by purposive people 
in legislatures and international unions, and in pamphlets of business ideol- 
ogists in Northern England. Modern institutionalism, to create a caricature, 
is Durkheimian in the sense that collective representations manufacture them- 
selves by opaque processes, are implemented by diffusion, are exterior and 
constraining without exterior people doing the creation or the constraining. 

The purpose of this essay then is to take some selected problems of modern 
economic sociology, and some theoretical generative principles of the old insti- 
tutionalism, and to show there is life in the old bones yet. In the background of 
the essay is a contrast between the old institutionalism in which people built and 
ran institutions, and the new Durkheimian institutionalism in which collective 
representations operate on their own. My primary purpose is generative rather 
than critical, so the picture I draw of the new institutionalism is contentious. But 
I hope that caricature is a stylized version of the central thrust of the newness 
of the new institutionalism, and particularly of the newness of its faults. 

This is then a review by contrast of new institutionalist economic sociology. 
Its main purpose is to revive some central mechanisms from the old institution- 
alism, and to use that revivification to throw light on the broad contours of our 
present difficulties. 

In the first part of the analysis, I treat particularly John Henry Wigmore, who 
studied the relation between the institutions of evidence law and the functioning 
of American and British courts as organizations. It is a peculiarity of such 
courts that they produce legitimate decisions, and so legitimacy is absolutely 
central to their survival, and that in order to produce legitimate decisions they 
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have to use legitimate means, especially legitimate evidence. Producing such 
legitimate evidence is the product of adversary court organizations that involve 
judges, contending counsel, juries, and a flow of evidence from the outside from 
witnesses and documents, actions with legal consequences such as delivery of 
signed documents, arrests on probable cause, and the like, all of which have to 
be organized and legitimated (or delegitimated) by the court. 

Because legitimacy is the intended product of these institutions, operating 
through court organizations, the whole enterprise is central to institutional the- 
ory. Much of the legitimacy of courts, as analyzed by Wigmore, has to do with 
effective means to achieve justice by intelligent processing of the substance 
of evidence. Compared to Wigmore’s, modern institutionalism has an impov- 
erished notion of legitimacy and of how the legitimacy of rituals depends on 
substantive good sense applied to questions of value.! 

The second and third parts of this chapter treat the moral commitment aspect 
of contracts for transactions as central to the possibility of the existence of orga- 
nizations in the market. The central old institutionalists treated are RH Coase, 
especially his “The Nature of the Firm" (1937), John R Commons, especially 
his Legal Foundations of Capitalism (1974 [1924], Joseph A Schumpeter, es- 
pecially his Business Cycles (Schumpeter 1964 [1939], pp. 46-83, 105-50; 
also see a more popular version of the argument in 1942), and Philip Selznick, 
especially in his Leadership in Administration (1957). The basic intuition here 
is Durkheim’s observation that there is a noncontractual basis of contract. 

In particular, the set of contracts that constitute a firm has a particular kind of 
noncontractual basis, outlined with great depth and perception by Commons. 
The function of that special basis for contracts that create firms is best outlined 
by Coase’s classic paper on the nature of firm boundaries as activity sets within 


'Wigmore was a lawyers’ law historian He tried to reformulate the history of the cases and 
statutes of the law of evidence so that it would make sense to judges and lawyers deciding evidence 
questions That is, his institutional purpose was to influence courts (including contending counsel) 
to decide evidence questions so as to achieve justice, on average Wigmore therefore tended to 
make more sense of history than probably was originally there and to ignore cases that did not 
make sense from his point of view He also was not an ethnographer trying to reflect accurately the 
informal aspects of the use of evidence law in practice; his purpose was to be authoritative rather 
than ethnographic. Wigmore believed in institutions and in scholarship as a way to improve them. 
My own view ts that he succeeded, but his purpose 1s different from my purpose here of showing 
that he was right about how the courts worked as institutions serving justice. Because I do not know 
tbe cases and history as well as he did, I am not 1n a poston to criticize his analysis from within 
(Twining 1990 has some relevant matenal). Sociologists think of the law of evidence mainly in 
connection with constitutional cases on criminal law, especially for capital crimes against persons 
Wigmore was well aware that most law practice is law on economic claims denving from contract 
and property and that it has more to do with the contract clause in the Constitution than with the 
Bill of Rights, and most crime is against property So for my purposes here, he 1s analyzing an 
economically relevant institution, court procedure on evidence questions 
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which other elements of the firm have a competitive advantage over outside 
firms providing the same services or goods in the market, because of lesser 
“transaction costs.” Schumpeter is the classic source on the relation to the larger 
competitive structure of capitalism of that monopoly advantage that comes from 
a firm’s having superior productivity. All three writers are interested in the form 
of institutions that makes the competitive structure of capitalism possible, given 
what that structure is, namely the competition of firms that can do something 
better than anyone else. 

The impoverished view of modern institutional theorists (especially in the 
“organizational ecology branch”) reduces the conception of competition to that 
of the relations among organizations’ that have the “legitimate” organizational 
form for that “population.” This conception leaves out many aspects of what 
the traditional institutional theorists actually thought about competition. The 
transaction costs literature (e.g. Williamson 1975) preserves more of this con- 
tent, except that it does not study legitimacy of the market itself, and so it has 
a vacuous description of what firms (as hierarchies) are contrasted to. The 
conceptions of Commons and Schumpeter of how competitive markets came 
to be legitimate are not vacuous. 

The second part of this chapter concentrates on how contracts in the market 
come to be legitimate by the way they are constituted, by the mutual belief of the 
contracting parties that each is committed to the line of action promised in the 
contracts. We emphasize especially the commitment of a firm to be competent 
in the future to carry out the activities specified in the contracts. Contracts are 
what markets centrally consist of, so their legitimacy is central to the legitimacy 
of markets—a fact well known to institutional economists in the past. Coase 
then analyzes what it is about the firm that is the noncontractual basis of con- 
tracts, namely its special competencies. Commons analyzes these as they are 
embedded in working rules, while Schumpeter treats them as innovations that 
have specific kinds of effects on the capital market and therefore on business 
cycles. Selznick treats the processes by which commitment to competencies 
are built into organizations. 

The third section treats the problem of the legitimacy in the law of market 
competition as a system, and its relation to the legitimacy of being able to do 
things better than competitors as a moral and legal claim on the profits of such 
competence. 

The fourth section is an analysis of a classic formulation of the failure of in- 
stitutions by Edward Banfield, especially in his The Moral Basis of a Backward 
Society (1967 [1958]). Banfield’s basic argument is that institutions, and com- 
mitment to institutions, are essential to the creation of public goods. In turn, 


2The firms are assumed to be identical, each firm counting as one, in most applications. 
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economic progress involved the production of public goods, including obvious 
ones such as roads or civil order, and not so obvious ones such as the willingness 
to discuss what we should do next in a spirit of honesty and compromise. When 
the institutional means to create public goods are not available, welfare seems 
to people to depend on looking out for themselves and their immediate kin, 
rather than on trying to create greater welfare for all. Banfield applied this anal- 
ysis to Southern Italy. Diego Gambetta (1993) has a compatible analysis about 
why the Mafia as a business supplying guarantees for contracts in a distrustful 
environment does not constitute a good basis for economic development. 

I apply this line of argument to explain why capitalism in Russia is not 
as successful as it has been in the West—perhaps not even as successful as 
socialism was—because the socialist institutions that enable people to create 
public goods have been destroyed but not replaced by institutions appropriate to 
capitalism. In particular, I argue that, under socialism, contracts between firms 
and the corporatist industrial authorities in central planning agencies were used 
to create many of the requisite public goods. The destruction of the capacity 
of those authorities to reward firms for public-oriented behavior has left the 
enforcement of contracts to Mafia-like organizations, as in Southern Italy when 
Banfield was there. 

Older institutionalists did not assume that institutions were always there and 
always worked. Consequently the causes of variation in the effectiveness of 
institutions were part of Banfield's research program. The creation of public 
goods is particularly problematic in a war of each against all. Institutions can 
(but need not) reduce the impediments of the free rider problem, that people 
can benefit from public goods without contributing to their creation. The level 
of success in creating roads, or law and order, or bargaining in good faith are all 
sensitive indicators of institutional effectiveness. The “amoral familism" that 
Banfield found among Southern Italian villagers has its analog in the "amoral 
firmism" of postsocialist Russia, and in the mafias, a good deal like the southern 
Italian Mafia, that enforce contracts. 

These are four rather different arguments. The first studies a clear case in 
which legitimacy of rituals in courtrooms varies with how well they substan- 
tively achieve justice in the use of evidence in the law and so produce legitimate 
decisions. My argument is that unless the rules of evidence are guided by con- 
siderations of justice, they do not produce legitimacy. The second studies why 
firms are willing to enter into contracts that require the counterparty to maintain 
into the future the competence to do the services contracted for. The argument 
is that only 1f firms are committed in several ways, including morally, will that 
promise to maintain competence be believed. The third outlines why it ever 
becomes legitimate for competitors to do damage to each other, and how civil 
law has had to be arranged so that that competitive conflict gives rise not to 
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claims in court, but instead to “legitimate” competition. Schumpeter makes 
strong arguments that economic development depends on innovators, firms that 
do not imitate other firms but are nevertheless legitimately able to do damage to 
entrenched business interests. And he and Commons both argue that this legit- 
imacy of competition has always been precarious, and that only a few kinds of 
institutions can successfully institute it. Finally we study institutions that some- 
times do not work, and Hobbes’s outcome of the war of each against all reigns. 
Banfield argues that when institutions do not work to create public goods, the 
incentives to morality and cooperation that otherwise flow from successful cre- 
ation of public goods (e.g. higher income from economic development) fail to 
operate. Then only mafias, with the kind of consent they elicit, create even a 
primitive social order, a noncontractual game theoretical basis of contract that 
does not work as well as business ethics and effective civil law. 

The overall argument of these four subarguments is that the blank places 
in writings of modern institutionalism come especially from lack of detailed 
theory and research about how particular institutions work. Much of the nar- 
rowness in modern institutionalism in organizational theory is explained by 
lack of detail in the conceptions of institutions. A narrow conception is eas- 
ier to mathematize. This in turn is due to ignoring the work of people who 
put the detail into institutions and who constrain people and organizations to 
conform to institution's exteriority. But if the guts of the causal process of insti- 
tutional influence are left out of the model, then we successfully mathematize 
abstract empiricism, an empiricism without the complexity of real life. Wig- 
more, Commons, Schumpeter, and Banfield knew a lot about variance in how, 
and how well, institutions work; it behooves us to imitate them. 


Ritual Means of Survival vs. Institutionalized Values 


It is the duty of United States appellate courts to behave exactly as specified in 
institutional theory, namely to take as their principle of decision and of writing 
opinions the institutionalized standards specified by the Supreme Court. These 
and other institutionalized standards are embedded ın the legitimate decisions 
at common law and statute law, defined as those that other courts, including 
the Supreme Court, would recognize as valid. Thus according to the new 
institutionalist principles announced in Meyer & Rowan (1977), the court orga- 
nizations with their contending counsel, judges, juries, and flows of witnesses 
are engaging in the formalities of evidence law as a ritual to legitimate what 
they are doing. If there were no deeper principles behind this activity, there 
would be no sense in their writing opinions about why they decided the way 
they did, no reason for either of the contending parties to appeal, and so no 
reason to pay for appellate courts at all; it would be sufficient for them to say 
that the Supreme Court would probably favor the plaintiff (or the defense). 
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Thus it is only because we imagine that justice will be served that we set 
up the whole apparatus of appeals—of judges guessing what each other will 
say—to demand that they obey the legitimizing formal procedures that Meyer 
& Rowan specify and Wigmore on Evidence (1983 [1940]) describes. In short, 
the acceptance of the idea that the appellate court was a ritual would destroy 
its legitimacy. And it is clear that the written opinions of appellate courts are 
central to the allegation that it is not just a ritual, which could as well have 
been dispensed with by allowing the court of original jurisdiction to specify 
what was legitimate under the Constitution and to make its own guess about 
the Supreme Court; and the Supreme Court could in its turn merely send back 
a yes or no, without majority and minority and concurring opinions. 

Similarly in scientific papers, the formal rituals of citation of scientists other 
than the author show that the author is oriented to contributing to a body of 
knowledge recognized by others; there is no obvious ritual reason for the results 
section that actually makes that contribution. We could enormously simplify the 
formal organization of journals if we did not have to have referee comment on the 
results section. There is the same trouble ın explaining the actual algebra in an 
algebra textbook adopted by the California or Texas state textbook certification 
organizations, the actual fertilizer that backed up the coopted farmers on the 
Tennessee Valley Authority boards (Selznick 1949), the actual "cleaving only 
unto" that is the putative organizational consequence of the wedding vows, the 
actual dollars that back up the annual report of a corporation. 

What we have to explain, then, 1s why it 1s so essential to organizations that 
their formal rituals not seem to be a shuck, and why they lose legitimacy as soon 
as it appears they are only interested in the letter of the law, the words in the 
vows, the places on the TVA boards, the right number of pages in the algebra 
books. Why did I, for example, get indignant when my son in the preschool 
learned the correct answer that it is hotter in summer because then the Northern 
Hemisphere is closer to the sun, and he favorably impressed the educational 
authority but not his father by "getting it right"? I was dissatisfied with the 
preschool book because its answer was substantively wrong, not because it was 
not officially authorized. 

We do find formal ritual criteria of whether an appellate opinion is serious 
organizational substance; for example, what court an opinion comes from mat- 
ters for its ritual status. But why is it that intermediate appellate courts take 
producing opinions with those ritual marks so seriously, as opposed to saying 
they guess the Supreme Court might decide yes (or no) and leaving it at that? 
Why would we be dismayed if an appellate court opinion had all the citations 
of cases but missed the point—more dismayed than if it missed citing a crucial 
case and so had to invent the principle of justice over again? Why would we be 
less dismayed if a school taught analytic geometry before Buclidean geometry, 
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outside the canonical order, than if students never learned to treat geometrical 
problems with algebra? 

In short, why is formality never enough for legitimacy, without some convic- 
tion that the formality is just a more abstract form of the substance? Why is it, 
for example, that a principle of legitimacy of a contract is “(d) The court shall 
determine whether the writing was intended by the parties as a final expression 
of their agreement with respect to such terms as are included therein” [California 
Code of Civil Procedure (as amended in 1978), as quoted in Wigmore (1983 
[1940]) p. 75n]? The institutional rule here relies on a substantive determi- 
nation about the wills of the parties, rather than just a formal reading of the 
content of the writing. 

I believe we have recently underestimated the degree to which people accept 
institutions because they think the institutions have the right answer, because 
institutions embody a value that the people also accept. When Selznick (1949) 
showed that a major consequence of “grass roots” administration in the TVA 
was the cooptation of leading potential farmer opponents of TVA into the de- 
fense of government power generation in the Valley, we all thought this made 
TVA’s claim to be democratic more suspect. Formal democracy was not good 
enough for us. Why would we expect that it would be good enough for other 
students of public administration or for citizens? We, too, did not want the TVA 
to go through the democratic motions, because we hold democracy as a value, 
not as a legitimator. Its legitimating capacity follows from its being a value. 

We could investigate this by looking at which social movements succeed 
against formally institutionalized practices. For example, Jeremy Bentham un- 
dertook to argue that the formality of the rules of evidence in his time un- 
dermined the use of reason and good sense in legal inference (Iwining 1990, 
pp. 38-41). The old institutionalist theory of Selznick and others 1s that reason 
and good sense are values, and formality a means to reason and good sense. 
Selznick’s theory would predict Bentham’s success. The view that justice mat- 
ters to the legitimacy of the formality would predict that the law of evidence 
after Bentham would come to put a good deal more faith in the discretion of the 
judge, a good deal less faith in formal versions of the precedent in evidence law. 

And that is what the modern textbooks on the codified statutory law of evi- 
dence say (Rothstein 1981, pp. 10-12). In part the new institutionalism would 
predict that Bentham should have lost out, particularly among lawyers who are 
dependent on predicting what the other lawyers that run the courts will decide. 
The profession of law depends on isomorphism, that the arguments they make 
will be the same as the arguments the courts will accept. It seems rather that 
lawyers wanted instead for judges to use reason and good sense, and not be 
bound by institutionalized rules except those that specified when and on what 
grounds the judge could intervene to use his/her good judgment. That is, in the 
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long pull, lawyers favored rules that used the reason and good sense of judges, 
rather than formal rituals of correctness. 

For example, one observes in the law of evidence that the criteria of evidence 
are applied much more strictly when a hearing deals with substantive rights than 
with the formalities of the course of an issue through the court system, so that 
hearings on a change of venue have very informal rules of evidence. Further, 
the law is more painstaking when the rights at stake are more serious, as in 
criminal cases compared to civil cases (e.g. exclusions of evidence because it 
was illegally obtained are much more common in criminal cases), in criminal 
trials as opposed to grand jury indictment hearings (e.g. the grand jury rather 
than the defendant decides what evidence the defendant can offer), in habeus 
corpus hearings than bail bond hearings, in cases of child custody rather than 
child support levels, in search and seizure of evidence rather than discovery 
hearings [Wigmore on Evidence (1983 [1940]) vol. 1A, pp. 25-332 gives a 
summary of variations in the law of evidence according to jurisdictions].? The 
more justice matters, in short, the greater the formality of the application of the 
law of evidence. 

This point is crucial to our differences with the new institutionalism. We want 
to predict when the institution will demand more formality, not when the orga- 
nization will more enthusiastically adopt the institution’s standards, as Meyer 
& Rowan (1977) do. It is precisely because the behavior of institutional author- 
ities in enforcing standards varies that it is important to notice that institutions 
are staffed, rather than being merely collective representations. 

Similarly, votes in academic departments are counted much more carefully, 
and the votes are more likely to be by secret ballot and kept confidential (the 
doctrine being that one gets honest opinions only with immunity of the voter), 
when the issue is a crucial matter such as a tenure case. Another academic 
example: Graduate students are sent ritually out of the room when their dis- 
sertation defense is being debated and voted on, but not in a seminar where 
their contribution to discussion is to be debated. The criteria and procedure for 
examinations and their grading are often discussed in the syllabus and in fac- 
ulty handbooks, whereas when the professor will actually appear for his office 
hours, as opposed to what is announced, is more informally discussed by angry 
students outside his or her door. In short, the more a judgment of academic 


?When substantive rights are dealt with in administrative hearings (e g. hearings on welfare 
rights) or by agreed arbitration, more informal laws of evidence apply Here we would predict 
that the more important the substantive right, the more formal the rules of evidence compared to 
other administrative hearings or arbitration proceedings Similarly Jerome Skolnick argues (1975 
[1966], pp 155—161) that the bigger the case, the more careful police are to follow the dictates of 
the law of evidence as well as other determinants of the quality of evidence (e.g. policemen make 
better witnesses than drug dealers), because they know evidence will be evaluated more strictly 
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merit matters, the more formalized and institutionalized we find it to be. It is 
hard to imagine that this is all a matter of impression management, that no one 
really cares whether students and junior faculty are evaluated by what they have 
accomplished, rather than by formally correct procedures. But in particular, 
institutional authorities demand more conformity with institutional values the 
more important the decision is. 

My argument then is that variations in the bindingness of institutional rituals 
are to be explained by beliefs about what the institution is for. The more impor- 
tant the issues at stake, the more important will be those rituals that are carefully 
justified as serving the value. As a criminal case passes from questioning on 
the street, to arrest, to indictment, to conviction, the degree of bindingness of 
the rules that make evidence legitimate increases. As the career consequences 
of an academic judgment increase in importance, the greater the formality of 
the specification of what is to be judged, the more likely committee rather than 
individual judgment, the larger the number of stages, and the more the rights 
of appeal. As a negotiation of a contract proceeds, so increases the degree of 
formality of judgment about what has been agreed to, whether the agreement 
is final, and whether the terms have been made clear either during the process 
of negotiation or by reference to the practice in a trade. 

Formality and ritualization, then, increase with the substantive importance 
of the issue, because in general the reason for having things institutionalized 
and ritualized is that they matter. Or to put it the other way around, when the 
value system informing an institution ranks something as of high priority, it is 
more likely that the keepers of the institution will formalize conformity with 
the institution in a ritual designed to monitor, enforce, and enact the value of 
that something. The higher the priority, the higher the formality and ritual. 
The more ritualized a thing is in an institution, the less it is merely a ritual, 
because the more substantively important it is. Or to return to our example of 
evidence law, the more justice depends on a bit of evidence, the more formal 
evidence law there is about the introduction of that evidence. 


Transaction Costs vs Moral Commitment 


The contracts that constitute a market are ordinarily agreements about future 
performances. It is that fact that makes the noncontractual part of contracts 
central, since the parties’ predictions of each other’s future behavior depend on 
predictions about each other’s future morality. But in particular, one routinely 
contracts for another firm’s services when that firm can do something better 
than one can do it oneself. Consequently, it is in particular the sort of morality 
that will guarantee the other firm’s future competence that is central to the 
creation of most markets. If in the future the other firm is not going to be more 
competent at their business than you are, why make a contract with it? 
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To relate the old institutionalism to the institutionalism of transaction cost 
analysis, it is best to start with old transaction cost analysis, in Coase (1937). 
Coase defined the maximizing boundary of a firm as the division between the 
set of things it could do better or cheaper than it could hire or buy. An airplane 
developer and manufacturer can make planes better than an airline can. But 
it can only borrow the money to develop a passenger plane if airlines sign 
contracts in advance to buy (Newhouse 1982). Airlines will buy them for the 
future only if they believe manufacturers will seriously and competently use the 
money to develop and make a better airplane. Airlines will make commitments 
only if the airlines in their turn believe they themselves will still be able to 
compete in selling traveling and express shipping services. 

In particular, airlines will have to sell airline services better than Boeing, 
Lockheed, or McDonnell-Douglas, for if the manufacturers were better at run- 
ning airlines as well as at building airplanes, the airlines would merely be 
financing their competition. Bankers only believe the futures contracts between 
airlines and airframe manufacturers if they think the manufacturer can and will 
build a better airplane, and the airlines will still be there to buy it when the 
contract comes due. It is an immediate derivation of Coase's theory that part, 
at least, of the noncontractual element of contract has to be mutual belief in 
each other's future superior competence. Much of one's transaction cost has 
to do with trying to guarantee by contractual means the future competence of 
the counterparties, without giving over to them the rent of one's own superior 
future competence in one's own business. The nature of this system of mutual 
belief in each other's competence becomes particularly clear when a great deal 
of money, now and in the future, has to pass between three different kinds of 
firms, each relying on the success of the relation between the other two. 

The expected return of all the transaction costs of these development, manu- 
facturing, and loan contracts ultimately depends on the faith of all three parties 
that the manufacturer will show devotion to building better, cheaper, and safer 
airplanes. Unless there is competence and will behind that transaction-cost 
boundary, unless Boeing or Lockheed or McDonnell-Douglas can really do 
some things better than they (or anyone else) can buy on the market, the whole 
web of contracts comes crashing down. Likewise unless the airlines behind 
their firm's equilibrium boundary can do their job, there will be no money to 
pay for the better plane, so in turn no money for the manufacturer to pay the 
banker. The same, changing the changeable parts, holds for the banker pricing 
risks responsibly for the capital market, and using banking equity to back other 
contracts that split up the risks for appropriate parts of the capital market. 

Transaction costs are worth paying, then, only when there are two (or in this 
case at least three) firms believably trying to be and do what it takes to be best 
at their own business. Only if a firm can be believed will any other firm prefer 
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it to internal activities, thus creating the boundary between the two firms and 
the necessity of a contract. 

Similarly, a university contracts out its janitoring and waste removal tasks, 
but keeps its research and teaching in-house, because it has a believable com- 
mitment to scholarship, and no noticeable commitment to clean buildings. It 
is the university’s commitment to research and teaching that ultimately makes 
the waste management firm believe it will be paid for its contract, for the uni- 
versity’s overhead costs will be paid only if students and grant-givers believe 
the university can and will do its job. Similarly the hospital buys a CATSCAN 
machine rather than build one itself, with the income or credit derived from 
its credibility to a client who thinks he may have had a stroke. If the client 
chooses the wrong hospital, that does not know enough to buy a CATSCAN 
machine, he might not get a second choice. And the CATSCAN manufacturer 
that sells to a hospital that cannot attract stroke patients will not be paid for the 
machine. 

The “hierarchy” (Williamson 1975) that lies outside the market, protected 
by transaction costs barriers, has to be a thing that produces reliably on into 
the future, so that its predicted future competence justifies paying transaction 
costs now. The transaction cost (a cost of risk) of going into the emergency 
room of a hospital when one is perhaps having a stroke will only be paid if one 
thinks the medical workers care enough to be competent and to stay competent, 
to work even through their daughter’s seventh birthday party if necessary, to 
triage mainly on the degree of urgency, to buy a CATSCAN machine before 
one has even had the stroke, and so on. 

Of course, the degree to which organizations create their distinctive compe- 
tencies by moral commitments is a variable. And the exact definition of what 
we would mean by that variable is difficult—presumably managerial commit- 
ment is more important than the commitment of hewers of wood and drawers 
of water, for example. But I imagine that we will define that variable more 
exactly by extracting indicators of it from Selznick’s Leadership in Adminis- 
tration (1957) than by abstracting them from Oliver Williamson (1975), because 
Selznick was primarily developing a theory of the “distinctive competence” of 
an organization, its ability to realize values in a way no other organization could. 


Birth and Death vs Institutions of Creative Destruction 


Schumpeter thought that many economic institutions that facilitated capital- 
ism as we know 1t depended on values other than capitalism. Imperialism and 
mercantilism were driven by the value of power as much as by profit. La- 
bor discipline could probably be achieved more easily by socialism than by 
a capitalism now shorn of its feudal governing classes. Values of democracy 
empirically associated with capitalism depended in their turn on competing 
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political elites who wanted to govern first, profit from government afterwards. 
Schumpeter perhaps fits uncomfortably with Selznick as a comember of an old 
institutional school. But one of the positions he holds in common with the old 
institutionalists is that the form of competition among organizations is histor- 
ically variable, depending a good deal on the values of the governing classes 
and their challengers. 

In Schumpeter’s argument the ruling institutions affected organizational ecol- 
ogy, and in particular what he called "creative destruction" (Schumpeter (1964 
[1939]), 1942). In the long run, the big competitors of an earlier population of 
organizations are new; later, populations of organizations of a different sort that 
reshape the niches on which earlier organizational populations depend. If it is 
true that the oxygen atmosphere of earth was created by the first chlorophyll- 
using anaerobic populations, causing the long-run shift to oxygen-breathing 
animals and “creatively destroying" most anaerobic strains, then biological 
evolution has powerful analogies to Schumpeter's reasoning. Thus to ana- 
lyze the within-species or within-"organizational population" competition is 
to miss the big story of evolution and of economic development: interspecies 
competition. 

Thus Schumpeter was primarily interested in the institutions that allowed the 
peaceful* destruction of whole populations of organizations. To put it another 
way, he wanted to know how modern society created such precarious legitimazy 
for a given type of organization that the jobs, profits, or community-sustaining 
contributions of whole industries could be destroyed. If someone else built a 
better mousetrap, all the accumulated legitimacy that accounted for the first 
part of the climb of the old population of cat breeders up the Gompertz curve 
was not apparently enough. Schumpeter regarded this precariousness of the 
legitimacy of old populations of organizations as remarkable, and he therefore 
regarded the institutions that allowed such destruction of people's livelihood as 
precarious. He argued that it was such creative destruction that caused depres- 
sions (Schumpeter 1964 [1939]), and he was political scientist enough to know 
that governments and the economy were confronted with challenges to their 
legitimacy when they allowed such destruction and its accompanying depres- 
sions. He thought the remarkable institutions facilitating competition between 
species of organizations were extraordinarily precarious. 

But I think the definitive analysis of what these institutions were is John R 
Commons's (1974 [1924]), especially his careful definition of the legal protec- 
tion of competition (1974, pp. 83-134). Commons starts his analysis of what 
legitimate competition means for capitalism by distinguishing the legal defense 


^ At least Schumpeter preferred peaceful destruction—he knew that organizations and institu- 
tions were often created and destroyed by conquest, but he thought that was bad for economic 
development As does Gambetta (1993) 
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of “working rules" of firms in competition? that are distinctive to a given corpo- 
ration. These are defended legally in the sense that only the Board of Directors 
of a corporation can allocate the property of the corporation or permit access 
on the shop floor, and even stockholders cannot take away a lathe or four days 
of a firm electrician to satisfy their property interest. Interference with working 
rules invades the legally defended autonomy of a competitive corporation, the 
autonomy to have distinctive rules that allow them to do something innovative 
that can routinely give a monopolistic advantage in a niche in a market. When 
this 1s combined with an opportunity in the market (Commons 1974 [1924], 
pp. 153—157), it produces a "going concern value," and the institutional pro- 
tection of economic opportunities distinctive to capitalism is the protection of 
that going concern value (Commons 1974:172-213). 

What Commons (especially when combined with Schumpeter) managed to 
do, then, was to jump over the institutionalism of organizational ecology of 
one population (developed especially by Adam Smith for firms, and theorized 
differently by Hannan & Freeman 1977, 1989 recently) directly to the establish- 
ment of an evolutionary ecology of multiple competing species (defined by their 
working rules and by the opportunities—or niches—those exploit). Commons 
was an institutionalist and thought these were defended in the legal institutions 
of capitalism. When hooked together they explain also why Tobin’s q-ratio 
of the market value of corporations to the replacement value of their assets is 
ordinarily greater than 1 in economic booms (Tobin & Brainard 1977), why 
there are business depressions in which that ratio goes down in populations be- 
ing eliminated, and why both boom and, more surprisingly, bust are defended 
by institutions. There is no place for institutions to abolish organizations on a 
large scale in modern organizational ecology—they only legitimate them. 

Darwinian evolution does not need institutions defending competition either 
within or between species. Humans defend themselves socially from competi- 
tion by tariff barriers, racial prejudice, conspiracies in restraint of trade, guild 
regulations, patent offices, professional societies and licenses to practice, trade 
unions, zoning regulations, protection rackets, immigration and naturalization 
services, conquest of competitors, and the genocide of indigenous peoples. 

In human history, competition has not ordinarily been legitimate. It 1s a 
wonder that modern organizational ecologists have not noticed this. Most insti- 
tutional conditions restrict competition. Institutions that allow people's liveli- 
hoods and capital to be destroyed by competition are rare. Schumpeter and 
Commons knew that and considered the legitimacy of "exposure" to other peo- 
ple's "liberties" the main thing to be explained. Capitalism, particularly its 
legitimate competition and creative destruction part, is not at all natural but is 


5See also essentially the same concept in "routines" in Nelson & Winter 1982 
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instead an institutional creation. To use legitimacy to explain only the upswing 
of organizational growth rates, and to forget that the legitimacy of intensified 
competition by firms with innovations that destroys large legitimate popula- 
tions, would have seemed a peculiar blindness in the days of Commons and 
Schumpeter, when Smoot-Hawley tariffs and immigration quotas were in the 
land. The precarious legitimacy of maquiladoras over the border in Mexico 
might teach us again that competition itself is still not very legitimate. 


That Socialist Institutions Do Not Work Does Not Mean 
Capitalist Ones Do 


In many East European countries the populations have decided that they have 
tried capitalism and it does not work, so they are voting their old communist 
leaders back in. In the old institutionalism, the failure of capitalism to work 
in southern Italy was also argued by Edward Banfield, in The Moral Basis of 
a Backward Society (1967 [1958]). The basic idea of that book was that some 
sorts of institutions undermine capitalist organizations, and that they do so by 
failing to provide integrity 1n the achievement of public goods (law and order— 
especially civil law, city organization, roads, or economic development offices 
facilitating factory and firm foundation). 

Banfield’s basic notion was that if the nuclear family was so set up that its 
solidarity and interests invariably overrode those of other institutions, then those 
other institutions could not do their job. Institutions that depend on generosity 
of spirit and attention to collective welfare are especially vulnerable. Part of 
institutions’ job was legitimating the fulfillment of occupational and leadership 
obligations in firms and in economically significant government operations. 
Large-scale corruption then in arenas requiring integrity in the pursuit of public 
goods is an outcome of a particular way of institutionalizing families. The 
Mafia and Cosa Nostra are in a certain sense not institutions because they are 
in the business of corrupting whatever institutions get in the way of short-run 
maximization of nuclear family interests (on the Sicilian Mafia, see especially 
Gambetta 1993). 

The basic point can be illustrated if we consider the role of accreditation 
associations in education, the field where Meyer & Rowan (1977, pp. 354—356, 
citing their 1975 paper) first observed the powerful role of institutions. Accred- 
itation societies do not have many formal powers, nor resources to distribute; 
they have only moral powers. One of the consequences is that they very gen- 
erally ask for volunteers from several educational organizations to go inspect 
another one. Now imagine a South Italian or Sicilian social organization as 
described by Banfield sending out unpaid volunteers and making them hold to 
educational standards of the sort Meyer wants to explain, but without making 
an offer they can't refuse. Paying off the accreditors would surely make the 
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formal report into even more cant and ritual than Meyer observes. A phrase 
from Teamsters’ leader Jimmy Hoffa in a trade union debate over political ac- 
tion occurs to me: “There are two kinds of political action. You can make 
speeches, or you can give money. We give money.” If schools gave money for 
accreditation, would the rituals of formal organization be convincing? 

The extension of Banfield’s institutional argument to capitalism can most eas- 
ily be seen by calling on a really old fashioned institutionalist, Emile Durkheim 
(1933 [1893]). Durkheim held that the division of labor rested on the noncon- 
tractual elements of contract, the commitment to values of commercial honesty, 
nonstrategic use of bankruptcy, advertising with some information value, com- 
petence in one’s occupation, and the like. Banfield’s argument then might be 
read as asserting that both the contracts between firms, and between govern- 
ments and firms, are not institutionalized under “amoral familism.” 

Now back to the attempt to introduce capitalism in Eastern Europe. The 
language describing Sicily is often used to describe the economic arrangements 
in capitalist Russia; the “mafia” in Russia does not have to be imported from 
Southern Italy. The legitimation of institutional relations under socialism was 
by way of the purposes of government, as manifested in government measures 
of outcome, government subsidies of production that was considered a public 
good, and the like. The contract between a firm and the government was hardly 
deeply theologically grounded, but it had noncontractual elements in the sense 
that other parts of the government would back up the corporatist industry glavk 
and help enforce conformity on the factory manager. 

This central planning legitimation system for supply, quality control, mea- 
surement of productivity, labor relations regulation, pricing and payment for 
delivery of finished or semifinished goods, and so on, no longer legitimates. 
Calvinism giving supernatural sanctions for commercial integrity seems to be 
thin on the ground. The capacity of the population to create public goods, such 
as industry standards-setting, credit extension and its credit-rating system, hon- 
est brokerage in stock and bond markets, is crippled because that capacity used 
to be all embedded in the central planning system. The lack of intermediate 
corporate bodies with legitimacy of their own based on the expectation they will 
live up to their own moral system then produces a condition like that Banfield 
called amoral familism. And that is not good for capitalism in Russia or other 
East European countries, any more than it was good for capitalism in Southern 
Italy. 


Ring Out the New, Ring In the Old 

The basic postulate here is that organizations that work well do so by paying 
people to serve values, to try to be competent, to conduct their business with 
integrity. It is easy to exaggerate the incentive effect of the payment, and to treat 
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it as 1f it were only a game-theoretical token. But payment is an institutional 
tool as well, for we pay for things we believe in. I would argue that paying 
for things we believe in 1s the core of what an institution is. We accept tokens 
of credit rather than money incentives ın algebra classes, and pay people to 
teach them, because we believe in algebra; the teacher does not get to cash in 
the A's he or she does not give out as in game theory, and the student does 
not get to cash them in at the store. Their only value is that everyone believes 
that there was some real mathematics behind them. Even new institutionalists 
at Stanford believe in algebra and are willing to pay for it, and they do not 
want to give that money to a school that does not believe in algebra. Nor do 
people want to give money to insurance companies that won't pay when they 
die, nor do they want to buy portable hard disk drives from a company that 
doesn't answer its technical help line,? nor to trust most of the capital markets 
in Eastern Europe. The combination of resources and believable commitment 
can only be created, so the old institutionalists argued, if people believe that 
the institutional enforcers themselves believe the values. 

Only with that assumption, for example, can we explain why capitalism can 
first legitimate the horse-and-buggy industry and then legitimate the automobile 
industry that destroys it, without changing the basic institutions of capitalism. 
And it takes the failure of the assumption that enforcers of commercial contracts 
believe in honest dealing to see why the Russians and Ukrainians know so 
well that capitalism does not work: the institutional enforcers do not believe 
that commercial honesty is possible, so they do not bother to create it. The 
assumption of transaction cost analysis is that one can go out on the market, 
pay all the transaction costs, and (sometimes) get better performance than one 
could produce oneself. It is only the contrast of that market contract with 
what the firm can do better itself that makes the boundary around the firm 
or hierarchy an equilibrium in Coase's sense. But one cannot explain why 
there is any there there unless one is willing to believe that transaction partners 
will deliver the goods, will maintain the competence and responsiveness that 
makes the supplier's work superior to what the firm can do itself, at least until 
the contract is fulfilled. One pays the transaction costs by contracting with a 
supplier with competence superior to one's own. Otherwise the equilibrium 
firm size is the whole economy, perhaps as in the old USSR. 

In short, the trouble with the new institutionalism is that it does not have 
the guts of institutions in it. The guts of institutions is that somebody some- 
where really cares to hold an organization to the standards and 1s often paid to 
do that. Sometimes that somebody is inside the organization, maintaining its 


6My advice 1s to call the technical support number before buying—lomega has a wonderful 
computerized phone system for giving a client 1n trouble the runaround., 
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competence. Sometimes it is in an accrediting body, sending out volunteers 
to see if there is really any algebra in the algebra course. And sometimes that 
somebody, or his or her commitment, is lacking, in which case the center cannot 
hold, and mere anarchy is loosed upon the world. 
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ABSTRACT 

The savings and loan crisis of the 1980s was one of the worst financial disas- 
ters of the twentieth century. We argue here that much financial fraud of the sort 
that contributed to this debacle constitutes "collective embezzlement," and that 
this collective embezzlement may be the prototypical corporate crime of the late 
twentieth century. We further argue that the state may have a different relation- 
ship to this kind of financial fraud than to manufacturing crime perpetrated on 
behalf of corporate profits. In the conclusion, we suggest that an understanding 
of the relationship between financial fraud and state interests may open up new 
regulatory space for the control of these costly crimes. Our data come from a wide 
variety of sources, including government documents, primary statistical data on 
prosecutions, and interviews with regulators. 


INTRODUCTION 


The savings and loan (S&L) crisis of the 1980s was one of the worst financial 
disasters of the twentieth century. Experts gauge its cost to US taxpayers over 30 
years to be approximately $500 billion, including interest payments on the gov- 


ernment bonds sold to finance the industry's bailout (US General Accounting 
Office, cited 1n Johnston 1990:1; National Commission on Financial Institution 


Reform, Recovery and Enforcement [NCFIRRE] 1993:4). Reliable estimates 
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of the percentage of losses due to criminal wrongdoing range from 10% to 
44% (Office of Thrift Supervision 1990, Resolution Trust Corporation 1992, 
Akerloff & Romer 1993, NCFIRRE 1993). 

The savings and loan scandal was by no means an isolated phenomenon in the 
high-flying 1980s. Insurance fraud, junk bond manipulation, insider trading, 
and securities fraud took their place alongside the thrift debacle in the headlines 
and on the nightly news. By the end of the decade, Charles Keating, Michael 
Milken, and Ivan Boesky had become household words, their names synony- 
mous with corporate wrongdoing. Besides this ill-gotten notoriety, and the vast 
wealth accruing from their legitimate and illegitimate activities, these offenders 
had something else in common. Unlike much of the corporate crime of earlier 
decades addressed by so much of the scholarly literature—for example, the elec- 
trical company conspiracy, the Pinto scandal, unsafe practices in the pharmaceu- 
tical industry, and occupational safety and health violations (Geis 1967, Cullen 
etal 1987, Braithwaite 1984, Carson 1981)—their crimes had nothing to do with 
production or manufacturing but instead entailed the manipulation of money. 

In this case study of the savings and loan disaster, we argue 1. that much fi- 
nancial fraud of this sort constitutes a specific form of corporate crime, what we 
call “collective embezzlement;" 2. that this collective embezzlement may be the 
prototypical corporate crime of the late twentieth century in the United States, 
in that it corresponds to the shift away from manufacturing or industrial capital- 
ism to finance capitalism; 3. that the state may have a different relationship to 
collective embezzlement than to manufacturing crime perpetrated on behalf of — 
corporate profits; and, 4. that by examining this change in the nature of corpo- 
rate crime, and exploring the state’s reaction to these offenses, we may be able to 
delineate with more precision the relationship between the state and economic 
activity and more generally to advance state theory. In the conclusion, we sug- 
gest that an understanding of the relationship between financial fraud and state 
interests may open up new regulatory space for the control of these costly crimes. 

The data for this paper come from a wide variety of sources. In addition 
to secondary data such as government documents, regulators’ reports, and 
other published accounts of the S&L crisis, we have gathered two kinds of 
primary data. The first consists of interviews with 105 government officials 
who were involved in policymaking, regulation, prosecution, and/or enforce- 
ment, in Washington, DC, and in field offices in California, Texas, and Florida, 
where the bulk of thrift! fraud took place. These unstructured, open-ended 


lWe use "savings and loans"and “thrifts” interchangeably here to refer to federally insured 
savings institutions that have as their primary historical function the provision of home mortgage 
loans. See Calavita & Pontell (1990, 1991, 1993) for a discussion of the regulatory changes in the 
early 1980s that freed these institutions to make direct investments, that effectively altered their 
function as home-mortgage lenders, and that provided extensive opportunities for fraud. 
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interviews provide substantial background information on techniques of fraud 
and the state response. Second, we have assembled several statistical data sets 
on the nature of thrift crime and the government's prosecution effort. These 
data were compiled from the statistics provided us by three federal agencies: 
the Resolution Trust Corporation (RTC, the federal agency created to man- 
age and sell assets from seized thrifts); the Office of Thrift Supervision (OTS, 
which after 1989 took over from the old Federal Home Loan Bank Board the 
task of examination and enforcement); and the Executive Office of the US 
Attorneys (which made prosecutorial decisions). These statistical data pro- 
vide detailed information on criminal referrals, indictments, convictions, and 
sentencing. 

Of course the picture is far from complete and our data are necessarily im- 
perfect. For one thing, we can never be sure how much crime goes undetected, 
nor whether the crimes that do come to light are representative of thrift crime 
in general. While this is a concern in all criminological research, it is partic- 
ularly acute in the area of white-collar crime where fraud is often disguised 
within ordinary business transactions (Katz 1979), proving fraudulent intent is 
problematic, and potentially criminal wrongdoing consequently is not prose- 
cuted. Using the "criminal referral" as our indicator of fraud goes partway in 
addressing this “front end” problem in white-collar crime research. 

Criminal referrals, usually filed by examiners or regulators or by whistle- 
blowers at the institution itself, describe suspected crimes, name individuals 
who may have committed crimes, and estimate dollar losses. These referrals 
are sent through the regulatory field office to the FBI and the US Attorney’s 
Office for investigation, where in a relatively few cases they result in indict- 
ments. The accuracy of these referrals as a measure of thrift crime—like the 
accuracy of statistics on “crimes known to the police” as an indicator of street 
crime—is limited by the existence of a “dark figure of crime,” that is, the sig- 
nificant number of crimes that are never detected or reported. We use criminal 
referrals here to gain some sense of the incidence and patterns of thrift crime, 
with the understanding that they do not cover the entire population of fraud.” 

The following section draws on these data to describe the basic forms of 
thrift fraud. As we will see, despite their apparent complexity, many of these 


7There may be some cases in which referrals were filed in the absence of actual criminal 
misconduct. However, our conversations with regulators, investigators, and prosecutors, and our 
perusal of the detailed referrals to which we gained access, suggest that these forms were by no 
means filed frivolously Because of the amount of information required on these forms, and because 
the reputation of the agency that filed a referral was at stake, they were likely to be limited to the 
most egregious cases of suspected misconduct, particularly as caseloads bogged down in the late 
1980s See Calavita et al (1997) for a description of the triage-like prosecutorial decisions that 
resulted in indictments in only one case out of every seven I referrals 1n Texas, and one 1n 
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frauds were variations on a fixed number of themes. As one of our informants 
put it, “It was as 1f someone had found a cookie cutter” (personal interview). 


TYPES OF S&L FRAUD: HOT DEALS, LOOTING, 
AND COVERING UP 


Corporate crimes of the sort involved in the savings and loan scandal are among 
the most complex white-collar offenses ever committed, leading one journalist 
to complain that when "regular people" tried to figure it out, they got “hopelessly 
bored and confused, as though they'd fallen a month behind in their high- 
school algebra class" (O' Rourke 1989:43). The concerned layperson is helped 
considerably by the fact that there are a limited number of basic formulas for 
abusing thrifts. Before describing these basic formulas, we need to note that 
our discussion is focused on insider abuse, by which we mean those frauds in 
which thrift owners and/or managers were central players, either alone or in 
collaboration with outsiders. 

While some have implied that the industry's problems in the 1980s were 
brought on by outsiders who victimized thrifts (Lewis 1989, Lowy 1991), the 
schemes at the heart of the scandal required the participation of insiders. Our 
RTC data,’ which allow us to identify the organizational positions of those cited 
in criminal referrals, confirm that high-level insiders were participants in the 
majority of these frauds. Of the 2265 criminal referrals in this data set, insiders 
were listed as suspects in 1294 or 6396. Far from a case of naive insiders 
being victimized by slick con artists, most thrift fraud at these institutions was 
self-inflicted. 

At the most general level, we can decipher three types of insider abuse, 
called here hot deals, looting, and covering up. All involved one or more viola- 
tions of federal bank fraud statutes, including prohibitions against “kickbacks 
and bribes” (18 USC Section 215); “theft, embezzlement, or misapplication of 
funds” (18 USC Sections 656,961c); “schemes or artifices to defraud federally 
insured institutions” (18 USC Section 1344); “knowingly or wilfully falsify- 
ing or concealing material facts or making false statements” (18 USC Section 
1001); “false entries in bank documents with intent to injure or defraud bank 
regulators, examiners...” (18 USC Section 1005); and/or “aiding and abetting 
and conspiracy” (18 USC Sections 2, 371). The following sections describe 


3 As part of its mandate to manage the assets of seized thrifts, the RTC collected data on criminal 
referrals through its Thrift Information Management System (TIMS). This system recorded and 
tracked criminal referrals at all RTC mstitutions (that is, those institutions that had been declared 
insolvent and taken over by the RTC) The TIMS data referred to here summarize the contents of 
“Category 1” (suspected thrift fraud of $100,000 or more) criminal referrals filed by the RTC or 
other federal agencies through May, 1992 
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hot deals, looting, and covering up, and outline their contribution to the thrift 
debacle. 


Hot Deals 


These investment frauds were the sine qua non of insider abuse, providing both 
the cash flow from which to siphon off funds and the transactional medium 
within which to disguise it. Four kinds of transactions were central in these 
deals. As Senate Banking Committee staff members Alt & Siglin (1990:3—5) 
explain, these transactions include such schemes as land flips, nominee loans, 
reciprocal lending, and linked financing. Land flips involve selling a property 
back and forth among two or more partners, inflating the price each time and 
refinancing the property with each sale until the value has increased many times 
over. Alt & Siglin (1990:4) use the following example: 


A sells a parcel of real estate to B for $1 million, its approximate market value B finances 
the sale with a bank loan .. .. B sells the property back to A for $2 million. A finances the 
sale with a bank loan, with the bank relying on a fraudulent appraisal. B repays his original 
loan and takes $1 million in ‘profit’ off the table, which he shares with A. A defaults on 
the loan, leaving the bank with a $1 million loss. 


While the flip technically could be achieved without the lending institution's 
participation, thrift insiders were often collaborators. For one thing, insiders 
were sometimes business associates of the corrupt borrowers, who at a future 
date would exchange the favor. In addition, while the lending institution was 
left with an overpriced property on its hands, in the short-term, it made upfront 
points and fees from the huge loans, from which executive bonuses could be 
drawn. 

It was not unusual in the mid-1980s for partners to sit down and in one af- 
ternoon flip a property until its price was double or triple its original market 
value, refinancing with each flip. The playful jargon for this was “cash-for- 
trash,” and the loans were “drag-away loans,“ because the intention from the 
beginning was to default, dragging away the proceeds. Don Dixon at Vernon 
Savings and Loan in Texas (nicknamed by regulators *Vermin") reportedly 
“flipped land deals [like] pancakes” (O'Shea 1991:76). He and associates like 
(fast Eddie) McBirney at Sunbelt Savings and Loan in Dallas (nicknamed by 
regulators "Gunbelt"), and Texas developer Danny Faulkner used land flips 
among other schemes to develop hundreds of miles of condominiums on I-30 
northeast of Dallas. Officers at nearby Empire Savings and Loan financed the 
land flips to inflate the value of the land and provide the rationale for making the 
condo development loans despite an already glutted Dallas real estate market 
(personal interviews). Faulkner and his partners hosted weekend brunches at 
Wise’s Circle Grill in the I-30 corridor. Invited guests included officials from 
Empire Savings and Loan, investors, appraisers, and increasingly, politicians 
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who drew huge campaign contributions from the events. Over breakfast, prop- 
erties quickly changed hands and millions of dollars of phony profits were made 
in a few hours (O’Shea 1991:32; personal interviews). 

Nominee loans were often used in conjunction with land flips. Nominee lend- 
ing used a straw borrower to circumvent loan-to-one-borrower regulations or 
restrictions on insider borrowing. One costly nominee loan partnership involved 
former dentist Duayne Christensen and real estate broker Janet McKinzie in 
Santa Ana, California. Christensen opened North American Savings and Loan 
in 1983. In partnership with McKinzie, Christensen used the thrift to make 
loans to his own real estate projects and to participate in multiple land flips, 
through a company owned and controlled by McKinzie. In one typical scheme, 
straw borrower David Morgan purchased property brokered by McKinzie's real 
estate company and financed by North American. He then resold the property 
at an inflated value to his own holding company. Fees and commissions poured 
into McKinzie's real estate brokerage. North American made upfront points 
and fees but apparently never saw the proceeds from the resale of any of the 
properties (US Congress, House Committee on Government Operations, Sub- 
committee on Commerce, Consumer, and Monetary Affairs 1987a:304-9). The 
thrift was closed in 1988, at a loss of $209 million. 

Reciprocal lending was another way of circumventing restrictions against 
insider borrowing. Instead of making a loan directly to oneself, which would 
have sounded the alarm among regulators, two or more insiders at different 
thrifts made loans to each other. Making loans to each other is not in itself 
illegal, but making loans contingent on a reciprocal Joan is fraud (Alt & Siglin 
1990:5). These "daisy chains" often involved multiple participants, and unrav- 
eling them sometimes took investigators far afield of the original institution and 
exposed the complex conspiratorial quality of thrift fraud. One investigation in 
Wyoming in 1987 revealed a single daisy chain of reciprocal loans among four 
thrifts that by itself resulted in a $26 million loss to taxpayers (US Congress, 
House Committee on Government Operations, Subcommittee on Commerce, 
Consumer, and Monetary Affairs 1987b:79-80, 129-130). A Texas network in- 
cluded at least 74 daisy chain participants and involved all the insolvent thrifts 
in the state (US Congress, House Committee on Banking, Finance and Urban 
Affairs 1990:799-872). 

Similar in logic to reciprocal lending arrangements, linked financing in- 
volves depositing money in a thrift, with the understanding that the depositor 
will receive a loan in return. Loan broker Mario Renda specialized in setting up 
linked financing deals, even advertising in the Wall Street Journal, the New York 
Times, and the Los Angeles Times: “MONEY FOR RENT: BORROWING OBSTA- 
CLES NBUTRALIZED BY HAVING US DEPOSIT FUNDS WITH YOUR LOCAL BANK: 
NEW TURNSTYLE APPROACH TO FINANCING" (quoted in Pizzo et al 1991:127). 
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Renda placed large brokered deposits in thrifts, for which he received a finder’s 
fee, in return for which borrowers with credit “obstacles” received a generous 
loan from the thrift. Response to Renda’s ads was overwhelming; according to 
his own court testimony, Renda brokered the deposits for hundreds of linked 
financing schemes. 

Investigators and regulators report finding variations of land flips, nominee 
loans, linked financing, and reciprocal lending arrangements over and over in 
their autopsies of seized thrifts. Many hot deals combined elements of all four 
transaction frauds. So-called “ADC” (acquisition, development and construc- 
tion) lending is a good example. ADC lending was a form of direct investment 
by thrifts in the 1980s, in which thrifts provided up to 100% of the financing 
for a speculative development project, reaping proceeds (and collecting on the 
loan) only if the development turned a profit. Points, fees, and several years of 
interest payments were often included in the loan, so that it appeared on paper to 
be in good standing, and the thrift recorded income from the self-funded points 
and fees. Fraudulent ADC lending was at the core of the S&L debacle in Texas 
and helps explain continued investment in Texas commercial real estate when 
the market was already glutted. As the National Commission on Financial In- 
stitution Reform, Recovery and Enforcement (1993:48) explains, “ADC loans 
were an attractive vehicle for abuse. They bound up in one instrument many of 
the opportunities available.” A land flip might provide inflated collateral for a 
generous ADC loan; the loan then might be to a straw borrower who shared the 
proceeds with thrift insiders (nominee lending); thrift managers might make 
the loan contingent on a deposit or other investment (linked financing); and/or, 
thrift insiders might exchange loans to finance development projects with each 
other (reciprocal lending arrangements). As Texas S&Ls were pumped up with 
these huge development loans, they reported unprecedented paper profits from 
which thrift insiders extracted generous bonuses and dividends—one major 


f f “looting.” i 
ot ni PREFERENCE 
Looting 


Looting refers to the siphoning off of funds by thrift insiders and is thus more 
like traditional forms of crime than are the business transactions involved in hot 
deals. Because thrift management was doing the looting, it took different forms 
from a typical bank robbery or embezzlement by a lower-level employee. The 
most straightforward, but probably least common, way to loot was simply to 
remove deposits from the thrift and stash them away. Probably more common 
were shopping sprees with thrift funds, and excessive bonuses or other forms 
of compensation. 

David Paul bought CenTrust Savings Bank in Miami in 1983 and quickly 
turned it into a megathrift with $9.8 billion in assets (Pizzo et al 1991:404—5; 
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Lowy 1991:152—53). Paul used over $40 million of CenTrust money for a 
yacht, a Rubens painting, a sailboat, Limoges china, and Baccarat crystal (Lowy 
1991:152). In addition, he built the 47-floor Miami CenTrust skyscraper at a 
cost of $170 million (Mayer 1990:77; Pizzo et al 1991:405; Lowy 1991:152~ 
53). When CenTrust collapsed, it was the largest S&L failure in the Southeast, 
costing taxpayers $1.7 billion. For his part in the insolvency, David Paul was 
convicted of 97 counts of racketeering and fraud and sentenced to eleven years 
in prison (Los Angeles Times 1994:D2). 

Excessive compensation schemes were another way for insiders to loot their 
institutions. The US General Accounting Office (GAO)(1989:21) defines "com- 
pensation" as "salaries as well as bonuses, dividend payments, and perquisites 
for executives." A federal regulation limits permissible compensation for thrift 
personnel to that which is "reasonable and commensurate with their duties and 
responsibilities" (GAO:21). At the most expensive failures, executives typi- 
cally paid themselves exorbitant bonuses and dividends, even as their thrifts 
were collapsing (GÀO 1989:21). For example, Don Dixon and his top execu- 
tives at Vernon took more than $15 million in bonuses between 1982 and 1986, 
at a time when the thrift was already deeply insolvent (O' Shea 1991:217—18). 
During the six years that David Paul was driving CenTrust into the ground, he 
paid himself $16 million in salary and bonuses, $5 million of it coming in 1988 
and 1989 when the thrift was piling up losses from its junk bond investments 
(Pizzo et al 1991:406). 

Of the 26 most costly failed thrifts studied by the US General Accounting 
Office (GAO)(1989:21), shopping sprees and excessive compensation had oc- 
curred in the vast majority. A large proportion of prosecutions have been for 
such looting, probably because of the relative ease of building a convincing 
body of evidence for these more straightforward frauds compared to the com- 
plex business transactions involved in hot deals (personal interviews). In any 
case, hot deals and insider looting went hand-in-hand: The deals provided the 
cash flow and reported income with which to finance shopping sprees and exces- 
sive compensation—indeed the ability to siphon off phony “profits” provided 
the incentive for hot deals and the rapid growth they fueled. 


Covering Up 

As savings and loans teetered on the brink of insolvency, broken by hot deals 
and looting, their operators struggled to hide both the insolvency and the fraud. 
In some cases, the cover-up came in the form of deals whose primary purpose 
was to produce a misleading picture of the institution's state of health. US 
Attorney Anton R Valukas describes a number of such cover-up deals: 


In the prosecuted cases of Manning Savings and Loan, American Heritage Savings and 
Loan of Bloomingdale and First Suburban Bank of Maywood, when the [nominee] loans 
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became non-performing the assets were taken back into the institution, again sold at inflated 
prices to straw purchasers, financed by the institution, 1n order to inflate the net worth of 
the bank or savings and loan. The clear purpose was to keep the federal regulatory agencies 
. at bay by maintaining a net worth above the trigger point for forced reorganization 
or liquidation" (quoted 1n US Congress, House Committee on Government Operations, 
Subcommittee on Commerce, Consumer, and Monetary Affairs 1987b:99—100). 


Insiders also could simply doctor their books to shield their thrift from reg- 
ulatory action. At one S&L studied by the GAO (1989:41), three sets of 
books were kept—two on different computer systems and one manually. At 
another, $21 million of income was reported in the last few days of 1985 in 
transactions that were either fabricated or fraudulent, allowing the thrift to 
report a net worth of $9 million, rather than its actual negative $12 million 
(GAO 1989:44—45). McKinzie and Christensen at North America S&L pre- 
pared bogus documents when challenged by regulators. When the thrift was 
finally taken over, examiners found evidence of fake certificates of deposit, 
forged bank confirmation letters, and other cover-up materials (US Congress, 
House Committee on Government Operations, Subcommittee on Commerce, 
Consumer, and Monetary Affairs 1987a:308). 


Relative Frequency of Hot Deals, Looting, and Covering Up: 
The Dallas OTS Files 


For a more detailed picture of the part these insider frauds played in the thrift 
crisis, we can look at a subsample of Category 1 criminal referrals in one state, 
Texas. Here we rely on data from files in the Dallas Office of Thrift Supervision 
(OTS). This OTS office maintained computer files on all criminal referrals 
for thrift fraud in Texas. We selected a 20% sample from this list of 1210 
criminal referrals filed between January 1985 and March 1993, by choosing 
every fifth referral. We then examined the actual referral forms for each of the 
241 cases in our sample, as well as the numerous supporting documents that 
accompanied these referrals. For each case, we coded the type of suspected 
violation, whether insiders were involved, and the estimated loss from the 
suspected crime. 

Our main objective was to obtain a better sense of the crimes being reported at 
thrift institutions. Based on a careful perusal of these referral files, we developed 
11 specific categories of insider fraud (including, for example, insider loans, 
land flips, kickbacks, falsification of documents, etc). While many of these 
categories overlapped in practice, since these frauds were usually complex and 
often contained several layers of deception, we made distinctions according to 
what the primary offense or central ingredient of the suspected misconduct was. 
We then sorted these offenses according to whether they constituted hot deals, 
looting, covering up, or some other broad category of fraud. For example, 
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insider loans and land flips are hot deals, kickbacks are a form of looting, and 
falsification of documents is a method of cover-up. 

Of the 241 cases of suspected misconduct, 193 involved insider fraud, as 
opposed to victimization by outsiders, which is consistent with the RTC data 
discussed above. More important here, all but three of the 193 insider fraud 
cases involved some form of hot deals, looting, or covering up. The most com- 
mon form of fraud was hot deals, with various types of such deals comprising 
almost 68% of these suspected frauds and contributing by far the highest price 
tag—over $1.08 billion out of the total $1.6 billion in estimated losses. 


Hot Deals and Looting as “Collective Embezzlement” 


In discussing different forms of white-collar crime, Sutherland (1983:231) de- 
scribed embezzlement: “The ordinary case of embezzlement is a crime by a sin- 
gle individual in a subordinate position against a strong corporation.” Cressey, 
in Other People’s Money (1953), focused on the lone white-collar embezzler, 
stealing from his or her employer. Traditionally, then, embezzlement has been 
thought of as an isolated act of an individual employee who steals from the 
corporation for personal gain. 

Criminologists have typically drawn a sharp distinction between this “em- 
bezzlement" by individuals against the corporation and “corporate crime,” in 
which fraud is perpetrated by the corporation on behalf of the corporation. For 
example, Wheeler & Rothman (1982:1405) speak of two distinct types of whité- 
collar crime: “Hither the individual gains at the organization’s expense, as in 
embezzlement, or the organization profits regardless of individual advantage, 
as in price-fixing.” Similarly, Coleman (1987:407) argues, “The distinction be- 
tween organizational crimes committed with support from an organization that 
is, at least in part, furthering its own ends, and occupational crimes committed 
for the benefit of individual criminals without organizational support, provides 
an especially powerful way of classifying different kinds of white-collar crime.” 

Neglected in this dichotomy is the possibility of organizational crime in 
which the organization is a vehicle for perpetrating crime against itself, as 
in the hot deals and looting described here. This form of white-collar crime 
represents a hybrid between traditional corporate crime and embezzlement— 
crime by the corporation against the corporation—and might be thought of 
as “collective embezzlement.” Unlike Sutherland’s and Cressey's embezzlers, 
these “collective embezzlers” were not lone, lower-level employees but thrift 
owners and managers, acting in networks of coconspirators inside and outside 
the institution. Indeed, this embezzlement was often company policy. In some 
cases, it was the very purpose of the organization to provide a vehicle for 
fraud against itself. Wheeler & Rothman (1982:1406) have pointed to “the 
organization as weapon" in white-collar crime: “[T]he organization ... is for 
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white-collar criminals what the gun or knife is for the common criminal—a 
tool to obtain money from victims.” In the collective embezzlement of the 
thrift industry, the organization was both weapon and victim. 


COLLECTIVE EMBEZZLEMENT AND LATE 
TWENTIETH CENTURY CAPITALISM 


As we approach the twenty-first century, the US economy—and those of other 
advanced capitalist countries—is compared to a “casino” (Bates 1989:D1, Busi- 
ness Week 1985:78—90). In this casino economy, the largest profits are made 
from placing a clever bet, not making a better mousetrap. Unlike in industrial 
capitalism where goods and services are produced to make a profit, in finance 
capitalism—where the “means of production" are currency trades, corporate 
takeovers, loan swaps, and futures trading— profits come from “fiddling with 
money" (Trillin 1989). Nobel Prize winner Maurice Allais underscores the 
magnitude of this shift from an economy based on the circulation of goods 
to one circulating money itself, pointing out that “more than $400 billion is 
exchanged every day on the foreign exchange markets, while the flow of com- 
mercial transactions is only about $12 billion" (quoted in Bates 1989:D1). 

Beginning in 1980, the futures market, in which investors gamble on the 
future price of hogs, grain, tobacco, and other commodities, rapidly outpaced 
the production of the goods whose prices were being bet on. In 1983, US futures 
trading totaled $7 trillion, or $28 billion a day, and by 1985, futures trading 
was growing at a rate ten times the rate of industrial production (Harrison 
& Bluestone 1988:54). The US financial market was so active that in 1984, 
over $4 trillion in trades came through a single investment banking firm—First 
Boston Corp.—more than equaling the national GNP for that year (Plotkin & 
Scheuerman 1994:59), 

As the management of money and financial speculation outstrip production 
as the greatest sources of profit, this finance capitalism spawns vast new oppor- 
tunities for fraud such as the hot deals and looting discussed above. Further, 
there are relatively few constraints or risks associated with these lucrative op- 
portunities. Unlike corporate criminals in the industrial sector who generally 
commit crime to advance corporate profits and are constrained by concern for 
their corporation's long-term survival, collective embezzlers in the casino econ- 
omy have little to lose. With no long-term investment in the infrastructure of 
production, their main concern is to get in and out of the house with as much of 
the pot as possible. The effect of their crimes on the health of the casino, or even 
its long-term survival, are unimportant to these financial high-flyers. Business 
Week (1985:90) describes "all the games the casino society plays": “The object 
... is to get rich today, come what may.” Collective embezzlement, in which 
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highly placed insiders loot their own institutions, may be the prototypical form 
of white-collar crime in this context, much as violations of fair labor standards 
or consumer protections are to the industrial production process. 


COLLECTIVE EMBEZZLEMENT AND THE STATE 


Sociologists have long made a distinction between "social" regulations (such 
as occupational safety and health standards), which are aimed at controlling 
production processes, and “economic” regulations (such as insider trading re- 
strictions), which regulate the market and stabilize the economy (Barnett 1981, 
Cranston 1982, Snider 1991, Yeager 1991). While the former protect workers 
and consumers against the excesses of capital—and tend to cut into profits—the 
latter regulate and stabilize the capital accumulation process and historically 
have been supported by affected industries. 

This distinction is consistent with a structural approach to the state, which 
emphasizes the “objective relation" (Poulantzas 1969) between the state and 
capital (see also Althusser 1971, O'Connor 1973). This objective relation 
guarantees that the capitalist state will operate in the long-term interests of 
capitalists independent of their direct participation in the policymaking process 
or mobilization of resources. Central to this objective relation under capitalism, 
the state must promote capital accumulation because its own survival depends 
on tax revenues derived from successful profit-making activity, as well as the 
political stability that is contingent on economic growth. In addition, it must 
actively pursue “political integration" (Friedland et al 1977), “legitimation” 
(O'Connor 1973), or “the cohesion of the social formation" (Poulantzas 1969) 
in the interest of political survival and the economic growth upon which it 
depends. 

In this structuralist rendition, the state enjoys relative autonomy in its efforts 
to realize these potentially contradictory functions. In direct contrast to the 
instrumentalist model espoused by Domhoff (1967, 1979) and others (Kolko 
1963, 1965, Miliband 1969), structuralists argue that state managers are not 
captive to individual capitalists’ interests and indeed are capable of violating 
those interests in order to pursue the broader and more long-term interests of 
capital accumulation and political legitimacy. Nonetheless, its autonomy is 
“relative.” While the state may be free from the manipulation of individual 
capitalists or even of the business community as a whole, it is by no means 
autonomous from the structural requirements of the political economy within 
which itis embedded and which it must work to preserve (see Poulantzas 1969). 

Regulation scholars who borrow from this perspective have generally fo- 
cused on social—rather than economic—regulation. This literature addresses 
the lax enforcement of social regulations and ties that laxity to the capital 
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accumulation function of the state and the perceived costs of interfering with 
profitable industry (Barnett 1981, Calavita 1983, Snider 1991, Yeager 1991). 
These scholars also note that the legitimation mandate of the state periodically 
requires that it respond to political demands to shore up worker safety, reduce 
environmental hazards, or enforce labor standards. The point, however, is that 
active enforcement of social regulation occurs primarily in response to public 
pressure and legitimation concerns, and it recedes once political attention has 
shifted elsewhere and state legitimacy is no longer threatened (Barnett 1981, 
Carson 1982, Walters 1985, Calavita 1983, Gunningham 1987, Yeager 1991). 

In contrast, when the goal is economic regulation, the state tends to assume a 
more rigorous posture. Despite occasional protest from the individual capital- 
ists at whom sanctions are directed, the state rather vigorously enforces regu- 
lations that stabilize the market and enhance economic viability. Unlike social 
regulations that are implemented primarily in response to on-again/off-again 
legitimation needs, economic regulations are integral to the capital accumula- 
tion process and are thus more consistently and urgently pursued (Barnett 1981, 
Snider 1991, Yeager 1991). While case studies are far fewer in this area, some 
excellent research has focused on the US Securities and Exchange Commis- 
sion (SEC). As Yeager (1991) and Shapiro (1984) have shown, although the 
SEC is by no means omnipotent in the face of its powerful Wall Street charges, 
nonetheless it rather routinely seeks criminal sanctions and stiff monetary fines 
for elite offenders. 

Extensive empirical research documents this enforcement discrepancy. 
Clinard et al’s (1979) comprehensive analysis of enforcement actions against 
the 582 largest corporations in the United States during 1975 and 1976 found a 
strong relationship between level of enforcement and type of violation. While 
over 96% of “manufacturing violations” (involving social regulations concern- 
ing such things as product safety and food and drug standards) were handled 
entirely at the administrative level, only 41.5% of “trade violations” (involving 
economic regulations controlling bid-rigging and other unfair trade practices) 
were disposed of administratively. Further, while over 21% of trade violations 
were processed criminally, fewer than 1% of manufacturing violations were 
criminally processed, and no labor standard violations were prosecuted crimi- 
nally. Clinard et al (1979:147) conclude, “Corporate actions that directly harm 
the economy were more likely to receive the greater penalties, while those 
affecting consumer product quality were responded to with the least severe 
sanctions. Although over 85 percent of all sanctions were administrative in na- 
ture, those harming the economy were most likely to receive criminal penalties” 
(see also Barnett 1981). 

Based on these empirical findings and the structural theory with which they 
are consistent, the lenient treatment of corporate offenders documented by 
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Sutherland over 40 years ago and reaffirmed by subsequent white-collar crime 
scholars may be a function of the state’s relationship to capital. It may be, how- 
ever, that this documentation of leniency is related to the fact that the primary 
focus in this literature is on violations of social regulations that cut into profits. 

But, what of the collective embezzlement described above? If the structural 
logic is valid, then the state should have an altogether different relationship 
to collective embezzlement in the thrift industry than to traditional corporate 
crimes in the manufacturing sector. For one thing, the structural model would 
predict—and the empirical literature reviewed above supports this prediction— 
that the state would take violations of economic regulations quite seriously. 
Further, we would expect that enforcing banking regulations, which lie at the 
very heart of the economic system, would be among the state’s highest priorities 
and would thus be a showcase for enforcement. 

Tn addition, remember that collective embezzlement is aimed not at enhancing 
corporate profits, but at personal profit-making at the expense of the institution. 
In the S&L context, this “crime by the organization against the organization” not 
only decimated individual institutions, but threatened the demise of the whole 
industry, and with it the financial stability of the US economy. As a senior staff 
member of the Senate Banking Committee put it, “This [thrift] industry is very 
close to the heart of the American economy. We teetered on the edge of a major, 
major problem here. Well ... we got a major problem, but we teetered on the 
edge of a major collapse . . .. All these financial industries could bring down the 
whole economy" (personal interview). For the state whose functions include 
capital accumulation and long-term economic stability, we would expect that 
containing this collective embezzlement would have been a top priority. 

Instead, extensive evidence indicates that the state not only failed to avert the 
crisis but was complicitous in shielding thrift offenders from detection (Calavita 
& Pontell 1990, Calavita et al 1997, US Congress, House Committee on Stan- 
dards of Official Conduct 1989, National Commission on Financial Institutions 
Reform, Recovery and Enforcement 1993). In the remainder of this paper, we 
argue that a close look at the evolution of the thrift crisis and the role of govern- 
ment in delaying its resolution suggests the need to go beyond the reductionism 
of current state theory, toward a more synthetic model of state activity. 


PUSHING THE LIMITS OF STATE THEORY 


At first blush, the S&L debacle seems to reaffirm the instrumentalist notion of 
a state captured by monied interests in the form of campaign contributions and 
other less subtle forms of “honest graft” (Jackson 1988). Indeed, implicit or 
explicit collusion of government officials with thrift offenders seems to have 
played a significant role in thwarting regulation and thereby exacerbating the 
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crisis. At the lowest level of field inspectors and examiners—those with front- 
line responsibility for detecting and reporting fraud—there were occasional 
instances of cooptation by fraudulent thrift operators. One strategy of thrift ex- 
ecutives was to woo examiners and regulators with job offers at salaries several 
times higher than their modest government wages (Pizzo et al 1991, personal 
interviews). 

More important than these relatively infrequent forms of outright collusion 
by regulators and examiners were close connections between thrift industry ex- 
ecutives and elected officials. The powerful US League of Savings and Loans, 
the thrifts’ major lobbying group in Washington and generous donor of cam- 
paign funds, was a significant force in deflecting regulatory scrutiny, under- 
funding regulatory agencies, and generally postponing the closure of insolvent 
and fraud-ridden thrifts.* While the “Keating Five"—the five Senators who 
challenged San Francisco regulators on behalf of Charles Keating—provide 
the most visible example of Congressional attempts to rein in regulators, they 
were not alone. Congressman Fernand St Germain, Chair of the House Bank- 
ing Committee and cosponsor of the deregulatory legislation in the early 1980s 
that set the stage for the thrift crisis, was a frequent recipient of US League 
of Savings and Loans largesse. Having been observed regularly dining out in 
Washington on the US League's expense account, St Germain was investigated 
by the Department of Justice for conflict-of-interest violations. The Justice 
Department concluded that there was "substantial evidence of serious and sus- 
tained misconduct" by St Germain in his connections with the thrift industry. A 
House Ethics Committee investigation came to the same conclusion. No formal 
prosecution was initiated, and St Germain was voted out of office in 1988. He 
is currently a lobbyist for the thrift industry (Jackson 1988, Pizzo et al 1991). 

House Speaker Jim Wright was particularly adept at intervening on behalf of 
his thrift benefactors and often called Federal Home Loan Bank Board (FHLBB) 
Chair Ed Gray to task for his attempts at aggressive thrift regulation. On several 
occasions, documented in the independent counsel's report to the House Com- 
mittee investigating Wright's alleged conflicts of interest (US Congress. House 
Committee on Standards of Official Conduct 1989, see also the National Com- 
mission on Financial Institutions Reform, Recovery and Enforcement 1993), 
the Speaker of the House from Texas asked the chief regulator to back off of 
Texas thrifts and advocated on behalf of specific thrift owners who had con- 
tributed generously to his campaign fund. 

This complicity of government officials is exactly what instrumentalists 
would predict, and it sharply contradicts the structuralist notions of relative 


‘For a detailed discussion of the role of government corruption in the S&L debacle, see Calavita 
& Pontell (1994) and Calavita et al (1997) 
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autonomy and the priority placed on economic regulation and long-term finan- 
cial stability. The fact that state officials shielded the collective embezzlement 
of thrift operators at the expense of economic stability adds substantial cred- 
ibility to the instrumentalist model of the impact of raw economic power and 
influence-peddling. 

However, there is more to it than this. While it is true that the US League of 
Savings and Loans and its individual members exerted considerable influence in 
Congress, at the same time a vitriolic struggle between members of Congress 
and the FHLBB raged behind the scenes (Black 1994, Waldman 1990, US 
Congress. House Committee on Standards of Official Conduct 1989). By all 
accounts, Ed Gray and his staff at the FHLBB were stunned by the escalating 
thrift crisis in Texas and elsewhere, and they approached their assignment with 
urgency. Gray was said to have had his “Road to Damascus experience” (Black 
1994:9) watching a homemade videotape of miles of abandoned condominiums 
east of Dallas financed by the insured deposits of Empire Savings and Loan. 
He spent the rest of his tenure at FHLBB attempting to reregulate thrifts and 
encountering resistance from the industry, the White House, and Congress. One 
investigative reporter described the attack on Gray and his dogged persistence: 
“If Gray's reign as the bank board's chairman had been a fight ... they would 
have stopped it. ... He stood in the middle of the ring . . . taking repeated blows. 
But he never fell to the canvas" (Binstein 1987:48). 

This clash between regulators who were alarmed at the pending disaster 
and key members of Congress who shielded their thrift benefactors refutes not 
only the structural notions of uniform state purpose and relative autonomy, but 
instrumentalists' depiction of state actors as simply lackeys of monied interests. 
It suggests instead that relative autonomy may vary across the institutions that 
together comprise the state. Members of Congress, whose political careers 
depend on a steady influx of campaign funds, may be particularly susceptible 
to the demands of those with the resources to make large contributions. Civil 
servants in regulatory agencies, while certainly not immune to such political 
and financial pressures, may for structural reasons be less susceptible to them 
and periodically may take a more rigorous enforcement approach in the interests 
of economic stability. 

This account of the evolution of the thrift crisis suggests the need for a 
synthetic model of state action. As we have seen from the literature cited 
above, the state is capable of concerted action and rigorous regulation in the 
interest of financial stability, consistent with structural theory. On the other 
hand, as we have seen in the savings and loan crisis, the real-life political 
actors who comprise the state have their own political and career interests and 
are susceptible to a variety of external influences. Thus, while the state has a 
structural 1nterest in economic stability and therefore in containing collective 
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embezzlement, instrumental influences on state actors can—and periodically 
do—neutralize that structural interest and derail the regulatory agenda. 


DISCUSSION 


We have argued here that much of the insider fraud in the S&L industry in 
the 1980s constituted variations on “hot deals,’ “looting,” and “covering up,” 
and that some of these financial crimes can be thought of as collective embez- 
zlement. Distinct from the corporate crime in the manufacturing sector first 
documented by Edwin Sutherland (1983), this collective embezzlement is mo- 
tivated entirely by personal—not corporate—gain and erodes the institution’s 
financial health. 

We have also argued that the state’s relationship to such collective embez- 
zlement is quite different than to that of crimes perpetrated on behalf of the 
corporation. As a number of scholars have noted, the structurally based in- 
terest of the state in advancing corporate profits—or at least not interfering 
with them—helps explain the lenient treatment of corporate crime upon which 
the white-collar crime literature has generally focused. But to the extent that 
collective embezzlement undermines corporate profit-making and jeopardizes 
long-term economic stability, the structural restraints on enforcement do not ap- 
ply; indeed, structural imperatives would dictate a quick and rigorous response. 

However, as we have seen here, the US government response to the first 
signs of S&L fraud in the mid-1980s was conflicted and contributed to the 
crisis. Close examination of this response reveals that the savings and loan 
industry and its individual members were able to shield themselves temporarily 
through effective lobbying of key members of Congress and other officials. 
With regard to state theory, this pattern of government collusion in the crisis 
and intrastate struggle over regulation suggests the need for a more synthetic 
model, the beginnings of which we have sketched above. 

This analysis, however, also has policy implications. First, in contrast to 
the regulatory catch-22 surrounding the enforcement of social regulations, the 
state has an unequivocal interest (both long-term and short-term) in containing 
collective embezzlement. The recognition of this new form of fraud for what 
it is, and the understanding of its objective relationship to the state and eco- 
nomic stability, might open up new regulatory space for deterrence and rigorous 
sanctions. After all, the looting described above—and its relationship to the 
state—is different from bank robbery only in its magnitude and its destabilizing 
effect on the whole economy. As the former Commissioner of the California 
Department of Savings and Loans told a Congressional committee, '"The best 
way to rob a bank is to own one" (quoted 1n US Congress, House Committee 


on Government Operations 1988:34). 
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More broadly, this analysis suggests the need to revisit the role of monied 
interests in the political process, not just for the sake of political democracy but 
to shore up financial stability in this age of global economic transformation. 
For, while structural relations between the state and capital may allow——even 
dictate—a strict response to collective embezzlement, this response is subject to 
sabotage by those with the resources to woo policymakers with hefty campaign 
contributions. As one S&L regulator told us, 


“It was always the worst S&Ls ın America that were able to get dramatically more political 
intervention.... If you know you are engaged in fraud, what better return is there than a 
political contribution” (personal interview). 


In concluding, we urgently call for more research in this emerging area. While 
neither collective embezzlement nor its regulatory neglect are new, the epic 
proportions of these financial crimes and the increasing dominance of the 
finance capitalism on which they are based are unprecedented. Future re- 
search should build on the rich white-collar crime and state theory traditions 
to explore further the theoretical and policy implications of these important 
changes. 
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ABSTRACT 

The last decade has witnessed a plethora of macro studies of various forms of 
social control ranging from lynching to hospitalization. Unfortunately, these 
specific areas of research tend to be isolated from each other and do not constitute a 
recognizable literature. This paper shifts the focus of study from substantive 
forms of social control to theoretical issues that cut across them. One such issue 
is the relationship between forms of social control. First, the paper explicates this 
issue. Second, the paper reviews and critiques three specific research literatures 
on the relationships between various forms of social control that are isolated 
from each other although they bear on the same theoretical questions. Third, the 
paper argues that bivariate relationships between forms of social control are not 
meaningful theoretically in that they are not clearly derived from general theories 
of social control. Fourth, the paper argues that we should focus on the causal 
processes and structures that underlie the relationships between forms of social 
control and explicate their implications for these relationships. 


INTRODUCTION 


Although always an integral part of sociology, the study of social control has 
waxed and waned. Originally, the concept was defined broadly as any structure, 
process, relationship, or act that contributes to the social order; indeed, the 
concepts of social order and social control were indistinguishable (Meier 1982, 
Gibbs 1989). A consensus is now emerging that distinguishes social control 
from the social order it is meant to explain and that distinguishes among social 
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control processes. One basic distinction is between internal and external control 
processes. The former refers to a process whereby people adhere to social norms 
because they believe in them, feeling good, self-righteous, and proud when they 
do adhere to them and feeling bad, self-critical, and guilty when they do not. 
This process is sometimes termed “socialization.” External control refers to 
a social process whereby people conform to norms or rules because they are 
rewarded with status, prestige, money, and freedom when they do adhere to 
them and are punished with the loss of them when they do not. This process is 
sometimes termed coercive, external, or just social control. 

Only recently have researchers studied coercive social control as a dependent 
variable. These studies can be categorized as either micro or macro. Using 
individuals as the units of analysis, micro studies examine how various control 
activities such as arresting, prosecuting, and sentencing are affected by the 
legal, psychological, and social characteristics of people. These studies are 
reasonably well organized and synthesized, constituting a clearly defined body 
of literature. Using collectives, e.g. organizations and communities, as the 
units of analysis, macro studies examine how social control rates are affected 
by culture and social structure (Liska 1987, Horwitz 1990). Numerous studies 
examine the structure and functioning of the criminal justice system as the 
major social control bureaucracy, including studies of prison size and admission 
rates, prosecution rates, arrest rates, and police contact. Studies also examine 
the structure and functioning of the mental health system, focusing on factors 
that influence its size and social composition, as well as the structure and 
functioning of the social welfare system, focusing on factors that influence 
its size and scope, such as eligibility restrictions and payments. Additionally, 
studies examine various forms of collective action as social control, such as the 
historical emergence of the KKK, the historical pattern of lynchings, and the 
emergence of the Guardian Angels. 

Most of these studies focus on only one organization, policy, or program of 
either the criminal justice, mental health, or welfare systems. They are orga- 
nized or categorized by substantive forms of control (e.g. lynching, arrests, 
imprisonment, hospitalization, and welfare), rather than by theoretical propo- 
sitions. For the most part these topical literatures are isolated from each other. 
Although they sometimes address similar issues, they do not build on each other 
or even cite each other. Researchers studying imprisonment are criminologists 
interested in prisons; researchers studying lynching are specialists in race re- 
lations or collective behavior; researchers studying mental hospital admissions 
are interested in mental health; and researchers studying welfare are experts in 
social services. Because macro social control research is balkanized into iso- 
lated pockets around diverse forms of control, theoretical issues that cut across 
these forms are blurred, and the boundaries that define the study of macro social 
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control are diffuse and vague. Thus, although macro social control research has 
mushroomed, it has not contributed much to the development of macro social 
control theories over the last two decades. 

In this paper I attempt to develop theoretical and empirical bridges between 
macro forms of social control as one strategy for overcoming the present dif- 
fusion or balkanization of the field. I first review three small and isolated 
literatures that examine the empirical relationships between macro forms of 
social control. I conclude that empirical generalities across and even within 
these literatures are difficult to uncover. Second, I examine the theoretical sig- 
nificance of relationships between forms of control, that is, the extent to which 
the magnitude and direction of such relationships can be derived from general 
theories of control. I conclude that relationships between forms of control are 
not easily derived from general theories of control, making empirical studies 
of these relationships theoretically marginal. Third, drawing on these three 
empirical literatures and on general theoretical perspectives of control, I model 
the underlying processes of control and then derive their implications for the 
magnitude and direction of the relationships between forms of control. 


RELATIONSHIPS BETWEEN FORMS OF CONTROL 


I begin by examining the existing research literature on the relationships be- 
tween forms of control. It is organized into three distinct bodies of research 
more or less isolated from each other: the relationship between the criminal 
justice system (CJS) and the mental health system (MHS), such as between 
prison and mental hospital populations; the relationship between the criminal 
justice system and the welfare system (WS), such as between prison admission 
rates and welfare eligibility restrictions; and the relationship between levels of 
formal and informal forms of lethal control, such as between legal executions 
and lynchings. 


Criminal Justice And Mental Health Systems 

The relationship between the CJ and MH systems has been addressed through 
three research strategies: historical studies of the emergence of the two sys- 
tems, social indicator studies of the covariation between the populations and 
admission rates of the two systems, and survey studies of the prevalence of 
mental illness among the prison population and the prevalence of criminality 
among the mental asylum population. 


HISTORICAL STUDIES Social historians (Rothman 1971, Foucault 1965, 1978, 
Scull 1977, Ignatieff 1978, Garland 1985, Gillis 1989, Brown & Warner 1992) 
have examined the emergence of social control institutions in Europe and the 
United States during the seventeenth, eighteenth, and nineteenth centuries, a 
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period that saw the rise of contemporary social control institutions such as asy- 
lums, prisons, reformatories, orphanages, and almshouses. While agreeing that 
these institutions were part of a general social reform movement, social histo- 
rians disagree on the explanation. On one side of the debate, Rothman (1971) 
argues that as America shifted from an agricultural and a rural society to an 
industrial and an urban one, traditional forms of social control such as the fam- 
ily and religion ceased to be effective. In response, social reformers sought to 
build institutions that would instill the habits of discipline, obedience, and hard 
work of a bygone era. While these institutions may have achieved more control 
than reform, that was not the intention of the reformers. On the other side of the 
debate, Marxists (Ignatieff 1978, Scull 1977) embed the social reformers in the 
class structure of capitalism. Ignoring the reformer's proclaimed intentions, 
they explain the emergence of these control institutions as a concerted effort by 
elites to discipline the masses to the rhythms of capitalism. 

Whatever their disagreements, social historians tend to agree that contem- 
porary criminal justice and mental health systems emerged together as com- 
plementary responses by authorities and elites to the threat of urban disorder 
associated with the macro processes of urbanization, industrialization, and cap- 
italism. Yet, other than hypothesizing that asylums and prisons emerged during 
similar periods in response to similar social conditions, these studies do not 
explore how these control bureaucracies function together to control threaten- 
ing populations and actions. One exception is the recent work of Sutton (1990, 
1991). Studying social control during the Progressive era, he argues that be- 
cause of scarce resources, increases in some forms of control led to decreases 
in others. Specifically, increases in private institutions (more welfare oriented) 
for controlling youth led to decreases in public institutions (more coercive) 
for controlling them; decreases 1n almshouses led to increases in asylums; and 
increases in pensions led to decreases in asylums. 


SOCIALINDICATOR RESEARCH In the 1950s and 1960s considerable debate oc- 
curred on the medicalization of social problems, particularly crime. During the 
first half of the twentieth century, psychiatrists medicalized social problems, 
successfully arguing that the cause of many social problems, like crime, lies in 
the psychological malfunctioning of people and that the solution lies in the treat- 
ment of these people by medical specialists in treatment centers. Consequently, 
by the mid-1950s the US mental asylum population significantly increased to 
500,000 (Morrissey 1982), compared to the US prison population of less than 
200,000 (Bureau of Justice Statistics). However, from the mid-1960s these 
trends reversed. The asylum population significantly decreased and the prison 
population significantly increased, so that by the late 1970s the asylum popu- 
lation was down to 150,000 and the prison population was up to 300,000. The 
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asylum population has continued to decrease slightly and the prison popula- 
tion has continued to rapidly expand, approaching 1,000,000 by the mid-1990s. 
These trend reversals in both populations have stimulated questions of the extent 
to which prisons and asylums are functional social control alternatives. 

Using archival data, social indicators research addresses this question through 
an examination of the temporal and cross-sectional covariation between these 
two forms of control. Drawing on data from eighteen European and six non- 
European countries, Penrose (1939) originally suggested that social control 
systems evolve from physically controlling people (prisons) to medically treat- 
ing them (asylums), thereby suggesting an inverse relationship between prison 
and asylum populations. In what is perhaps the first rigorous study of this ques- 
tion, Biles & Mulligan (1973) report a negative correlation (—78) between the 
1968 US prison population and the number of hospital beds; but in a time se- 
ries of the United States from 1930 to 1970, Grabosky (1980) reports a positive 
correlation (.42) between the size of the prison and mental hospital populations. 

These studies have continued to yield inconsistent results, although they have 
become methodologically more sophisticated. Among the best are recent stud- 
ies by Inverarity & Grattet (1989) and Sutton (1991). Using a time series from 
1948 to 1985, Inverarity & Grattet report that the unemployment rate (taking 
the unemployed as a threatening population) affects the prison population and 
admission rates, the asylum population and admission rates, the welfare recip- 
ient rate, and the military personnel rate; but they report no evidence that these 
forms of control affect each other. On the other hanc, using a pooled cross- 
section of states from 1880 to 1923, Sutton (1991) reports that a decrease in the 
adult almshouse population leads to an increase in the asylum population. 


SURVEY RESEARCH Since the mid-1970s a third strategy has emerged for ex- 
amining the relationship between the criminal justice and mental health systems. 
Stimulated by the recent deinstitutionalization of asylums, this strategy exam- 
ines the mechanisms through which problematic populations move from one 
system to another (Steadman 1979, Arvanites 1992 1988, Cirincioe et al 1994). 
Because asylums today do not operate at full capacity, they provide space for 
overflows from other control bureaucracies, such as the criminal justice system. 
Various studies examine the extent to which threatening populations that in the 
past might have been admitted directly into asylums are now first processed 
into the criminal justice system. Rather than maintaining inmates with men- 
tal disabilities within an increasingly overcrowded criminal justice system, it 
seems likely that criminal justice authorities would be interested in transfer- 
ring them to other control systems, such as the mental health system. Thus, 
the criminal justice experience of the mentally ill should have increased after 
deinstitutionalization. 
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One set of studies examines the criminal justice experience (such as being 
arrested, imprisoned, and jailed) of people admitted into asylums before and 
after deinstitutionalization, from the late 1960s to the late 1970s. Studies of 
arrest seem to support the above conclusion. In one of the first studies, Melick 
et al (1979) report that in New York, patients with prior arrests increased from 
32% to 40% between 1968 and 1975. In a study of five states, Arvanites (1986) 
reports that patients with prior arrests increased from 29.8% to 46.6% between 
1968 and 1978. The data for imprisonment are more inconclusive. In one of the 
best studies of this period, Steadman et al (1984) report that for six states be- 
tween 1968 and 1978 prison inmates with prior mental hospitalization increased 
from 7.9% to 10.6%. However, reflecting the inconsistency in this literature, 
they also report that an increase occurred for only three of the states, but that it 
was large enough to outweigh the decrease for the other three states, yielding 
a net increase. If the mentally ill are over-arrested but not over-imprisoned, 
where are they going? Some researchers suggest that they are being ware- 
housed in city and county jails. Studies suggest that approximately 5% to 20% 
of jail inmates are psychotic (Teplin 1990, Palermo et al 1992, Arvanites 1992). 
While the evidence is far from conclusive, researchers suggest that when the 
criteria for admission to mental asylums were strengthened, practically closing 
the front door of asylums, authorities managed the less seriously mentally ill 
within local jails and transferred the more serious into mental asylums through 
the side door via legal mechanisms, such as incompetency to stand trial. 

In sum, over the years three bodies of research have emerged that address 
the relationships between the CJS and the MHS. Organized around disciplinary 
and research methods, these bodies of research coexist in splendid isolation 
from one another, although they address similar theoretical issues. Historical 
studies suggest that the criminal justice and mental health systems and many 
other forms of social control emerged together as complementary forms of 
control in response to conditions that threatened the seventeenth, eighteenth, 
and nineteenth century social order, particularly the position of ruling elites. 
Yet, over the years some forms have declined and others have expanded. Many 
scholars have come to study these forms, particularly the criminal justice and 
mental health systems, as functional alternatives. This work is organized by 
research methods: one examines the temporal and cross-sectional covariation 
between them, and the other examines the consequences of constricting one 
system (MHS) for the structure and functioning of the other (CJS). 

In the interest of theoretical integration, it is important to conceptualize 
these three research areas as different strategies that address different aspects 
of the same theoretical issue. All three strategies implicitly, if not explicitly, 
assume a model where social threat to the social order affects the criminal 
justice and the mental health systems. Three causal processes are assumed. 
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One, an increase in social threat leads to an increase in both the criminal justice 
and mental health systems. This is the causal process assumed by the social 
historians in their studies of the emergence of social control institutions during 
the seventeenth, eighteenth, and nineteenth centuries. Two, an increase in either 
the criminal justice or mental health system leads to a decrease in social threat, 
which in turn leads to a decrease in other forms of social control. This is the 
process implicitly assumed by the social indicator researchers in their studies 
of functional alternatives. Three, tightening the restrictions for admittance into 
one system (MH) decreases the effect of social threat on that system; this in turn 
increases the effect of social threat on other control systems (CJ) and increases 
transfers from the constricted system (MH) to the other system (CJ), which 
then controls the problem populations of both systems. This is the process 
assumed by the survey researchers in their studies of the social composition of 
the criminal justice and mental health systems and in their studies of transfers 
from the criminal justice system to the mental health system before and after 
deinstitutionalization of the latter system. 


Criminal Justice and Social Welfare Systems 


Welfare refers to those social arrangements that distribute benefits according to 
need. In modern societies the state is the major purveyor of these arrangements. 
There are at least three ways in which modern states distribute these benefits: 
occupational welfare, the redistribution of wealth through the nontaxation of 
occupational benefits, such as medical insurance; fiscal welfare, the redistri- 
bution of benefits through the taxation system; and public welfare, the direct 
redistribution of benefits. Discussion of welfare as social control is generally 
restricted to public welfare, particularly to those forms of public welfare (gen- 
eral assistance, aid to families with dependent children, and unemployment 
insurance) that appear to benefit the poor. 

Up to the economic crisis of the 1970s, growth in social welfare was viewed 
as a by-product of industrialization, and variation in social welfare was viewed 
as the outcome of policies to balance unemployment and inflation. Since the 
crisis of the 1970s, during which high inflation and high unemployment co- 
existed, social welfare growth and variation have been viewed as more prob- 
lematic. Viewing social welfare as primarily a governmental response to the 
social needy, one body of research examines the conditions by which the so- 
cial needy influence governmental policy (unionization, labor political parties), 
and another body of research examines the structure of the state itself (central- 
ized bureaucracy) as influencing government policy (Skocpal & Amenta 1985). 
Viewing social welfare as primarily a governmental response to control social 
disruption, particularly disruption detrimental to the interests of economic and 
political elites, another body of research (informed by conflict theory) examines 
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the conditions (mobilization of workers, unemployment rates, urban riots) that 
threaten the positions and interests of economic and political elites; this re- 
search views welfare as an effort to “buy off” the disadvantaged, committing 
them to an unequal social order. 

Piven & Cloward (1971) stimulated considerable controversy by arguing 
that the public welfare expansion, particularly of the AFDC program in the 
United States during the late 1960s and early 1970s, was a response to the 
urban riots during that period—an attempt to pacify an economically deprived 
and threatening population. Studies by Issac & Kelly (1981), Jennings (1983), 
Schram & Turbett (1983), and Chamlin (1989) provide various degrees of 
support for this thesis. 

While conceiving of some forms of welfare as a form of social control, 

, researchers have published very little on the relationship between welfare and 
other forms of social control. Various scholars (Box 1987, Box & Hale 1985) 
argue, however, that the relationship is negative. They point to cross-national 
comparisons that suggest that nations spending a lot on criminal justice spend 
very little on welfare and vice versa and to temporal trends that suggest that times 
of high criminal justice spending are times of low welfare spending and vice 
versa. For the most part these conclusions are based on only casual observations. 

I have been able to locate just a few systematic studies. Using a US time 
series, Inverarity & Grattet (1989) report little direct effect of welfare benefits 
(AFDC payments) on prison admissions, controlling for unemployment, crime 
rates, age composition, and prison releases. They suggest that these social 
control bureaucracies may function independently of each other, affected more 
by their own internal dynamics than by the dynamics of other social control 
bureaucracies. Inverarity & McCarthy (1988) argue that the issue may be more 
complex. They report that prison admission is more sensitive to unemployment 
rates in the competitive than in the monopolistic sectors of an economy. In the 
latter sector the CJS is less important for social control, and the unemployed 
are less threatening because they are more likely to be unemployed temporarily 
rather than permanently and they are likely to be eligible for welfare benefits. 
Coercive control is reserved for the more marginal workers in the competitive 
sector. This research, thus, clearly suggests an inverse relationship between the 
CJS and WS. When economic activity is monopolistic, welfare becomes the 
predominate form of control; whereas when it is competitive, coercion becomes 
the predominate form of control. 

Following up on this research, Colvin (1990) also suggests that certain forms 
of control may be more functional for certain modes of production and certain 
Jabor markets. The more monopolistic and primary the markets, the more likely 
welfare will be the form of control; the more competitive and secondary the 
markets, the more likely coercion (prison) will be the form of control. Using 
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a cross-section of US counties, Colvin reports that increases 1n the degree to 
which markets are primary increases welfare rates and benefits and decreases 
imprisonment rates, but that the degree to which markets are monopolistic does 
not predictably affect either welfare or imprisonment rates (also see Lessan 
1991, Schissel 1992). In sum, while there is considerable casual discussion of 
public welfare and imprisonment as alternative forms of social control, there has 
been very little systematic research. The research that has appeared provides no 
evidence that welfare and criminal justice systems directly or indirectly affect 
each other; instead, it suggests that the covariation between them depends on 
underlying causal conditions. If welfare rates are positively affected and im- 
prisonment rates are negatively affected by the same conditions, then welfare 
and imprisonment rates should negatively covary, not because they are func- 
tional alternatives, but because they are affected in opposite directions by the 
same social conditions. 


Legal Executions and Lynchings 

Recent years have witnessed a renewed research interest in lynching, partic- 
ularly in the relationship between lynchings and legal state executions, in the 
South from the post-Civil War era to the Great Depression (Reed 1972). Two 
general hypotheses have been advanced. Some researchers hypothesize that le- 
gal executions and lynchings are functional alternatives or substitutable forms 
of social control that satisfy people's sense for justice and revenge. As execu- 
tions increase, people's sense of justice and revenge is satisfied and lynchings 
decrease; as executions decrease, people's sense of justice and revenge is sat- 
isfied through various forms of self-help and vigilantism, including lynching 
(Phillips 1987). Other researchers argue that both executions and lynchings are 
attempts by ruling classes to manage the existing social order, and thus they 
hypothesize that legal executions and lynchings positively covary. As social 
threat, conflict, and tension increase, both lynchings and legal executions in- 
crease. While scholars have debated this issue for some time, researchers have 
recently addressed it empirically. 

Using a North Carolina time series from 1889 to 1918, Phillips (1986, 1987) 
stimulated interest in this issue. He reports that lynchings and legal executions 
were positively related from 1889 to about 1903 and have been unrelated after 
that time. He argues that both were political responses of white authorities to 
racial threat and conflict during the Reconstruction era. When the conflict and 
threat were reduced through the political disenfranchisement of blacks around 
the turn of the century, both lynchings and executions ceased to perform a 
political function; thus, they decreased and ceased to covary. 

Using an even more detailed and longer time series of both North Carolina 
and Georgia, Beck et al (1989) challenge these conclusions. For North Carolina 
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they report no relationship between lynchings and executions either before or 
after disenfranchisement, and for Georgia they report a positive relationship 
between them only before disenfranchisement, providing some support for 
the conflict perspective. The racial threat hypothesis also suggests that after 
disenfranchisement, which decreased the black political threat, the total level 
of lethal violence against blacks should have decreased. For this hypothesis, 
they find no support for North Carolina and some support for Georgia. Beck 
et al (1989) conclude that there is some support for the threat hypothesis and 
little support for the functional hypothesis. Indeed, they argue that executions 
and lynchings were not substitute responses to the same social threat. 

Massey & Myers (1989) examine the interrelationships between lynching, 
execution, and incarceration rates of black males in Georgia from 1882 to 1935. 
Upon controlling for drifts and trends in the three time series, they estimate 
cross-level correlations from 0 to 10 year lags between these three forms of 
social control, and they find just about no statistically significant correlations. 

In the most recent research, Tolnay & Beck (1995) examine the covaria- 
tion between legal executions and lynchings over time (1882-1930) and cross- 
sectionally for Southern states. While the time series suggests a negative effect 
of legal executions on lynchings, which is stronger in some periods than oth- 
ers, the overall effect is not statistically significant; while the cross-sectional 
analysis suggests a positive effect of executions on lynchings, which is stronger 
for some periods than others, the overall effect is not strong or statistically 
significant. (See related work by Soule 1992 and Olzak 1990.) 

In sum, there is little evidence supporting the functional hypothesis that 
lynching and legal executions negatively covary. Only some mixed evidence 
supports the conflict perspective that they positively covary and that the strength 
of that covariation depends on level of social threat. Perhaps as Massey & 
Meyers (1989) conclude, it is time to rethink the traditional assumption that 
forms of repressive social control are interdependent. 


EMPIRICAL COMMONALITIES 


I now briefly summarize the findings across the three substantive areas: criminal 
justice and mental health systems, criminal justice and welfare systems, and 
lynching and legal executions. 

There is little to no empirical support that these forms of social control neg- 
atively covary. National time series analyses of the relationships between im- 
prisonment and mental asylum admission rates and between imprisonment and 
welfare rates provide no support; state time series of the relationship between 
lynching and legal executions provide no support; and state cross-sectional anal- 
yses of the relationships between mental asylum and prison admission rates and 
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between prison admission and welfare rates provide no support. While some 
research does document the flow of people between the mental health and crim- 
inal justice systems, the number of people involved is too small to produce a 
noticeable effect on the overall relationship between these two systems. 

Some empirical evidence does however suggest that these forms of social 
control respond to the same causal conditions and thus positively covary. His- 
torical research indicates that forms of today’s criminal justice, mental health, 
and welfare systems emerged in about the same era in response to the same 
social conditions. Some time series research (Phillips 1986 & 1987, Beck 
et al 1989) suggests that lynchings and legal executions positively covary in re- 
sponse to the same social conditions (political threat and disenfranchisement); 
and a few time series and cross-sectional studies (Inverarity & Garattet 1989, 
Inverarity & McCarthy 1988, Colvin 1990) suggest that prison admission and 
welfare rates respond to similar causal conditions (unemployment in monop- 
olistic industries and competitive markets). Thus they covary to the extent to 
which these causal conditions covary. Yet, even these studies show only a weak 
positive covariation. 

If the relationships between forms of social control are weak, does this mean 
that these forms have little in common and might better be studied individually, 
and thus that the study of social control is nothing more than the aggregation 
of the study of specific forms of control? Indeed, Massey & Myers (1989) 
suggest that many forms of social control vary independently of each other, 
reflecting their internal organizational dynamics. Perhaps. Yet, we should not 
equate inconsistent and weak empirical relationships between forms of control 
with the lack of theoretical commonalities among them. Theoretically, com- 
monalities among forms of social control should refer not to the strength or 
sign of the empirical relationships between them, but to the causal processes 
and structures underlying them. Focusing on these common processes high- 
lights the theoretical commonalities among forms of control and, in principle, 
should allow us to theoretically derive the direction and magnitude of the re- 
lationships between them. Dependent on the complexity of such processes, 
positive or negative relationships and strong or weak relationships may be 
equally significant theoretically. I now turn to four major theoretical perspec- 
tives of macro social control for guidance on such common underlying social 
processes and their implications for the relationships between forms of social 
control. 


THEORETICAL PERSPECTIVES OF SOCIAL CONTROL 


I first review the two classic macro perspectives of social control coming out of 
the 1950s, 1960s, and 1970s (functionalism and conflict theory) and then two 


50 LISKA 


emerging perspectives (social movement theories and the new structuralism) 
coming out of 1980s. 

Conceiving of society as integrated and orderly, classic functionalism as- 
sumes that there is a general consensus on goals and values, that general needs 
for survival can be identified, that social structures function to maintain soci- 
ety’s values-goals-needs, and that social structures can be explained by these 
functions (Durkheim 1938). While “modern or neo” functionalism may not 
make many of these assumptions (Alexander 1985), much contemporary social 
control theory and research is guided by them. Social control is thought to 
maintain society’s values-goals-needs and indeed is thought to be explained by 
them. To the extent that the level and pattern of social control are effective— 
functional—they are assumed to persist and remain stable. These assumptions 
lead to hypotheses about social control and the relationship between forms of 
control. Assuming that social control remains stable at a level that is functional 
for a society, then as one form of control decreases for whatever reason, other 
forms will increase, or as one form satisfies society’s needs for a specific level 
of control, other forms will decrease (the functional equivalence, tradeoff, or 
hydraulic hypothesis). While the direction of the relationship between forms 
of social control seems clear (negative), the process whereby an increase in 
one form leads to a decrease in another is not so clear. Traditional versions of 
the theory couched in the language and logic of system needs and functional 
equivalence are extremely ambiguous about causal processes; indeed, there is 
even some reluctance to talk in these terms. On the other hand, some versions 
of the perspective (Stinchcomb 1968) try to capture traditional notions of func- 
tionality in terms of causal feedback loops. In such a model (Figure 1, Model 
A), 1t is argued that an increase in one form of control (SC1) is functional to 
the extent to which it reduces social threat (ST), which in turn decreases other 
forms of social control (SC2), yielding a negative relationship between them. 

Focusing on the fractures, conflicts, and competing interests within society, 
the conflict perspective assumes an uneven distribution of self interests in social 
control and of the power to implement them. It thus assumes that social control 
reflects the interests of the powerful (elites, authorities, and majorities) and 
that it is part of an overall strategy by them to manage the actions, events, and 
people that threaten their interests. Therefore, as the threat to their interest 
increases, the level and scope of social control increases (Blalock 1967, Turk 
1969, Spitzer 1975, Quinney 1977, Hawkins 1987, 1994, Liska 1992, 1994). 

Regarding the relationships between forms of social control, the conflict 
perspective is also ambiguous. Assuming that all forms of social control serve 
the interests of the powerful, on one hand it implies that all forms covary 
together; that is, an increase in the actions, events, and people that threaten the 
interests of the powerful leads to an increase in most, if not all, forms of social 
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control (Figure 1, Model B). On the other hand, it implies that if some forms 
of control are particularly successful at protecting the interests of the powerful, 
then other extant forms may decline and new forms may not emerge; that is, 
an increase in some forms of social control by decreasing threat may lead to 
a decrease in others, yielding a negative relationship between forms of social 
control, similar to the functional model (Figure 1, Model A). 

Loosely drawing on both conflict, symbolic interaction, and social construc- 
tionist theories, the more recent social movements literature conceives of society 
as fluid and changing. It focuses on the social processes through which social 
interests and cultural identities emerge and galvanize into social movements 
and through which social movements both define and change social institutions 
and organizations. In contrast to more traditional conflict theories, the empha- 
sis is less on threat to established interests than on the processes through which 
interests come to be defined, recognized, and mobilized. A vast body of studies 
examines the social processes through which social interests emerge to con- 
trol drugs, alcohol, mental illness, and delinquency (Gusfield 1963, Platt 1969, 
Hagen & Leon 1977, Sutton 1988, Warner & Brown 1992), though which these 
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interests galvanize into social movements, and through which these movements 
configure and reorganize the institutions and organizations of social control. A 
particular concern is the emergence of interests that galvanized into the Pro- 
gressive movement and the processes by which these interests reconfigured the 
social control of juveniles and the mentally ill. 

While this body of research examines various social movements that influ- 
ence multiple forms of social control, the underlying theory remains sketchy. 
Research emphasizes uncovering the specific social and historical contexts of 
specific movements, rather than extrapolating generalities from these contexts 
and movements, such as the relationships between forms of social control. 
Some movements are very specific, such as the anti-alcohol movement (Garfield 
1963), having few implications for other forms of control. Some are very gen- 
eral and far reaching, such as the Progressive movement, having implications 
for various control bureaucracies (e.g. delinquency, crime, mental health, and 
drugs) and thus for the relationships between them. With some exceptions (Sut- 
ton 1990, 1991), these relationships are not construed as problematic issues. 
Forms of social control are thought simply to emerge and covary to the extent to 
which emerging social interests find expression in them. Reflecting this focus, 
we substitute social interests (ST) for social threat in the model; reflecting the 
nature of the theory, we do not specify the signs of the causal paths from social 
interests to forms of social control (Figure 1, Model C). 

The 1980s also witnessed a renewed interest in enduring structured social 
relations (e.g. Blau 1977). As exemplified in the work of Black (1976, 1984, 
1993) and Horwitz (1990), structural theories embed the degree and form of 
social control in the structural characteristics of social relationships. Explana- 
tions of social control are couched in terms of these social relationships with 
minimal reference to human motives, values, and interests (or perceived threats 
to them). This perspective directs attention to different forms of control by ex- 
plicitly developing typologies for classifying them. Black (1976), for example, 
describes four forms of social control: penal, compensory, conciliatory, and 
therapeutic; and Horwitz (1990) further develops these and others. 

These forms of social control are embedded in general patterns of social 
relationships. Because many of these general patterns affect multiple forms of 
control, the perspective has implications for the relationships between them. As 
certain structural social relationships increase, some forms of social control in- 
crease and others decrease. For example, Black (1976) argues that as relational 
distance increases, remedial forms of social control (conciliatory and thera- 
peutic) increase and accusational forms (penal and compensatory) decrease; 
as stratification increases, conciliatory forms decrease; as upward law (upper 
status victimizes lower status) increases, therapeutic and compensation forms 
increase and penal forms decrease; and as downward law (low status victimizes 
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upper status) increases, therapeutic and compensation forms decrease and penal 
forms increase. 

While selected bivariate propositions have clear implications for the direc- 
tion of the relationships between forms of social control, the implications of the 
overall structure of the theory are not so clear (Greenberg 1983). For example, 
stratification increases penal, therapeutic, and compensative forms, implying 
a positive relationship between them; yet, stratification also increases vertical 
law (upward and downward law) relative to horizontal law (disputes among 
equals), which decreases therapeutic/compensative forms and increases puni- 
tive forms, thereby implying a negative relationship between them. In other 
words, the direct effect of stratification on penal and therapeutic/compensative 
forms implies a positive relationship between them, but the indirect effect of 
stratification on them through downward law implies a negative relationship 
between them (Figure 1, Model D). Because some bivariate propositions (e.g. 
the effects of stratification) imply a positive relationship between some forms of 
control and some bivariate positions (e.g. the effects of downward law) imply 
a negative relationship between these same forms, the implications of these bi- 
variate propositions taken together are problematic. The strength and direction 
of the relationship between SC1 and SC2 in Figure 1, Model D, can only be 
derived by fomulating the problem in multivariate terms and then by specifying 
theoretically the strength and direction of the relevant causal parameters or by 
estimating them empirically. 

In sum, each of the perspectives identifies commonalities in the casual pro- 
cesses that underlie macro forms of control. There is even some overlap and 
convergence among the perspectives as to the importance of social threat, al- 
though they differ on the sources and objects of that threat. Nonetheless, their 
assumptions and conceptualizations of the underlying processes (social threat or 
whatever) are ambiguous about its implications for the direction and magnitude 
of the relationships between forms of control. 

The functionalist perspective, with its concern for functional alternatives, 
suggests a negative relationship between forms of social control but is unclear 
about the causal process by which an increase in one form leads to a decrease 
in others. Some versions of the conflict perspective seem to suggest a posi- 
tive relationship, but others seem to suggest a negative relationship between 
forms of social control. The structural perspective, conceptualized as a series 
of bivariate propositions, suggests direct and indirect effects that sometimes 
imply inconsistent relationships between many forms of control. And the so- 
cial movement perspective does not seem to formulate general hypotheses that 
specify the relationships between forms of social control. Grounding its work 
in specific historical contexts, it implies that the relationships between forms 
of control depend on specific historical contexts. It seems that the general 
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theoretical perspectives of social control are much better at explaining specific 
forms of control than the relationships between them. This is one source of the 
balkanization problem. 


MODELING SOCIAL THREAT PROCESSES 


The problem in deriving the implications of these perspectives is that they are 
indeed very general perspectives. While focusing attention on some underly- 
ing processes, sometimes multiple processes and sometimes inconsistent ones, 
they are more suggestive than deductive. Hence, the theoretical meaning of 
the direction and magnitude of a relationship between forms of control is not 
always clear. Drawing on these perspectives, I now try to more explicitly model 
specific social control processes and to extrapolate their implications for the 
relationships between forms of control. My goal is not to develop the model. 
Rather, it is to argue that by explicitly modeling underlying social processes, 
we can derive their implications for the relationships between forms of con- 
trol; that these implications are not always simple but can be very sensitive to 
slight variations in these processes; and that by expressing these relationships 
in terms of underlying social processes, the relationships become theoretically 
meaningful. 

Ifocus on social threat processes because they are central to most perspectives 
of social control. While disagreeing on the sources and consequences of threat, 
they agree on its significance. The conflict perspective and to some extent the 
social movement tradition focus on the conditions that are threatening to some 
social groups and not others, reflecting assumptions about tensions and conflicts 
in society; and the functional and structural perspectives focus on conditions 
that are threatening to most people and groups (e.g. violent crime and war), 
reflecting assumptions about social consensus. 

At this point it is useful to distinguish and model the relationships between 
perceived social threat and its structural sources. Perhaps because perceived 
threat is so rarely measured in the macro literature, it is not distinguished from 
its structural sources. Both are used somewhat loosely and are interchangeable 
in the literature. Yet, there is no reason to assume that social conditions lead to 
social control unless they are perceived as threatening to economic and political 
elites, organizational authorities, or in some cases majorities. I thus explicate 
models in which perceived social threat (PST) mediates the effect of social 
structural sources of threat (SS) on forms of social control (SC) and in which 
the relationships between forms of control are expressed in terms of model 
processes. 

In linking the relationships between forms of control to underlying social 
processes (social threat), I draw on the logic of causal modeling. I distinguish 
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Model A 
: SC1(PS) 
SS(riot) —— PST 
SC2(AFDC) 


Model B NEF, EREI YCE 


p SC1(PS) 
SS(CR) —— PST ^ ' 


EN 


Figure 2 Threat models (SS, social structure; PST, percerved social control; PS, police size, 
AFDC, aid to families of dependent children; CR, crime rate, IR, imprisonment rate. 


SC2(IR) 


recursive from nonrecursive models. The former refers to models in which 
casuality is unidimensional. In terms of the models to follow, the sources of the 
covariation between forms of control are treated as exogenous variables: They 
affect forms of control but are not affected by them. Nonrecursive refers to mod- 
els that include feedback loops. In terms of the models to follow, SS, PST, and 
forms of SC are treated as endogenous variables that are part of feedback loops. 


Recursive Models 


In recursive models the implications of social processes for the variation among 
forms of control and for the covariation between them can be derived and 
expressed via the principles of path analysis. Thus if urban riots affect perceived 
social threat (PST), which in turn affects some agencies of the criminal justice 
system (police size) [as suggested by Jackson (1989)] and some programs of 
the social welfare system (AFDC) [as suggested by Pivan & Cloward (1971)], 
then we should expect these agencies and programs to be related. The exact 
magnitude and direction of the relationship can be derived from the principles 
of path analysis. For example, in Figure 2 Model A, the effect of SS (urban 
riots) on SC1 (police size) equals the effect of SS on PST times the effect of 
PST on SC1 (police size); the effect of SS on SC2 (AFDC) equals the effect of 
SS on PST times the effect of PST on SC2 (AFDC); and the correlation between 
SC1 (police size) and SC2 (AFDC) equals the effect of PST on SC1 times the 
effect of PST or SC2. If the effect of SS on PST equals .6, if the effect of PST 
on SC1 equals .5, and if the effect of PST on SC2 equals .4, then the effect of 
SS on SC1 equals .30 (.6 x .5), the effect of SS on SC2 equals .24 (.6 x .4), 
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and the model derived correlation between SC1 (police size) and SC2 (AFDC) 
equals .20 (.5 x .4). 


Nonrecursive Models 


To some degree, all of the theoretical perspectives assume, explicitly or im- 
plicitly, that some forms of social control affect perceived social threat either 
directly or indirectly by affecting structural sources of threat; yet, none of them 
elaborate the underlying causal processes or their implications for the relation- 
ships between forms of social control. 

In doing so, it is important to distinguish clearly the structural sources of 
perceived threat from perceived threat itself. Some forms of social control may 
affect both, some may affect one but not the other, and some may affect neither. 
It is equally important to distinguish the direction of the effect. Some forms of 
control may reduce perceived social threat, and some may increase it. Consider 
the size ofthe police force (per capita). We generally assume that police size, via 
processes of deterrence, decreases many structural sources of perceived social 
threat (e.g. crime rates), thereby reducing perceived social threat. Police may 
also directly affect perceptions of social threat. By 1ncreasing police visibility 
and control activities (e.g. arrests), police size may reduce perception of social 
threat; but by symbolizing and reminding people of the crime problem, police 
numbers may increase perceptions of social threat. Observing police peacefully 
walking their beat may be comforting, but observing them in action— constantly 
in action—may be frightening. The net effect of police activity on perceptions 
of social threat may thus depend on the relative strengths and time lags of these 
processes. 

The implications of such nonrecursive models for the relationships between 
forms of social control are very complex. I consider here just one relatively 
simple model. Model B in Figure 2 assumes just one structural source (crime 
rates) of perceived social threat (PST), which in turn positively affects two 
forms of control (police size and imprisonment rates), and one form (imprison- 
ment rates) in turn affects it. If the feedback effect is negative (imprisonment 
rates decrease PST), then increments to the variables in the loop oscillate and 
dampen; if the feedback effect is positive (imprisonment rates increase PST), 
then increments to the variables in loop escalate. If the absolute value of the 
loop coefficients is less than one, which is generally the case when the coeffi- 
cients are standardized, then the system equilibrates, allowing the calculation 
of the total effects of PST and SC1 (PS) on each other and of the covariance 
between SC1 (PS) and SC2 (IR) at equilibrium as well as at different stages in 
the equilibrium cycle. 

Consider the following example using Figure 2 Model B. If the PST effect 
(coefficient a) on SCi = .5 and the PST effect (coefficient c) on SC2 = 4, 
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and if there is no feedback effect, then the implied correlation between SC1 
and SC2 = .2(.5 x .4). If there is a positive feedback (coefficient b), then the 
implied correlation between SC1 and SC2 is larger than .2. The direct effects 
of PST on SC1 and SC2 are enhanced by the positive feedback loop, which 
is reflected in the correlation between SC1 and SC2. (See Hayduk 1987 for 
an intuitive discussion of enhanced effects.) If the feedback effect is .2, .3, 
.4, or .5, the implied correlation between SC1 and SC2 is .24, .29, .34, or .39, 
respectively. On the other hand, if the feedback effect is negative, then the 
implied correlation is smaller than .2. The total effects of PST on SC1 and SC2 
at equilibrium are less than the direct effects. If the feedback is —.2, —.3, —.4, 
or —.5, then the implied correlation between SC1 and SC2 is .09, .06, .03, or 0, 
respectively. Note that when the negative feedback effect (coefficient b) equals 
the positive threat effect (coefficient a), the implied correlations between PST 
and SC1 and between SCI and SC2 are zero; and when the negative feedback 
effect (coefficient b) is larger than the threat effect (coefficient a), the implied 
correlation between SC1 and SC2 is negative. 

This discussion is meant not to argue or demonstrate that any one of the mod- 
els constitutes the model of the underlying causal process, but to illustrate that 
the relationships between forms of control are easily derivable when the under- 
lying causal processes are explicitly modeled. The theoretical importance of 
these relationships lies not in their direction or magnitude, but in their derivabil- 
ity from a theoretical model of the underlying causal process. Depending on the 
model, the direction of the relationship may be either positive or negative, and its 
magnitude may be either small or large and be equally theoretically significant. 


SUMMARY 


Studies of social control are theoretically dissipating. Rather than being or- 
ganized around theoretical perspectives or even theoretical hypotheses, they 
are organized more and more around specific forms of control, such as prison 
admission rates; police size; mental hospital expenditures, release rates, and ad- 
missions rates; lynching rates; and AFDC growth rates. Researchers studying 
prison admissions are criminologists interested in prisons; researchers studying 
mental hospital admissions tend to be psychiatrists and psychologists interested 
in mental health; and researchers studying lynching tend to be sociologists and 
historicans interested in race relations. Hence, issues that cut across diverse 
forms of control are blurred, if not obscured. Studies of different forms of 
control that frame their work with similar theoretical hypotheses do not build 
on each other, do not cite each other, and frequently are not even aware of 
each other. While research has mushroomed, it has contributed little to the 
development of general social control theories. 
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One strategy to bridge the increasing balkanization of social control studies 
is to study the empirical relationships between forms of control. We have been 
able to locate three such mini literatures: the relationships between the crim- 
inal justice and mental health systems (generally prison and mental hospital 
admission and release rates), between the criminal justice and welfare systems 
(generally prison population and AFDC recipient rates), and between legal and 
illegal forms of control (generally state executions and lynch rates). Interest- 
ingly, while these literatures address similar issues, they, too, are isolated from 
each other, that 1s, they, too, do not build on each other or even cite each other. 

Some empirical generalities, however, can be observed across these three 
literatures: Most prominently, the empirical relationship between these macro 
forms is rarely negative; it tends to be positive although it is weak and not al- 
ways consistent across studies. What does this mean? Because the relationship 
is weak and mixed, it might be argued that specific forms of control follow their 
own internal dynamics, varying independently of each other form. Yet, weak 
empirical relationships between forms of control should not be equated with 
a lack of theoretical commonalities among them. Commonalities are found 
not in the empirical relationships between forms, but among the causal pro- 
cesses that underlie them. Dependent on the complexity of the casual process, 
the bivariate relationships between forms of control (the net outcome of these 
causal processes) can be positive or negative, weak or strong, and be equally 
theoretically significant. 

For guidance on the nature of these causal processes and their implications 
for the relationships between forms of control, I review four general theoretical 
perspectives of social control. Ironically, while these perspectives inform and 
frame many studies of specific forms of control, they are very ambiguous about 
therelationships between forms of social control. The functionalist perspective, 
while explicit about the direction of these relationships, is very vague about 
the causal processes that underlie them. The conflict perspective is vague 
about even the direction of the relationships. The social movement perspective 
suggests that the direction depends on specific but unspecified social contexts. 
And the structural perspective frequently leads to inconsistent predictions about 
the strength and even the direction of these relationships. 

Drawing on the three empirical literatures and the four theoretical perspec- 
tives, I formulate models of social threat processes that have implications for 
multiple forms of control and thus for the relationship between them. My con- 
cern is not to build the model of that process, but to develop plausible models 
and to explicate their implications for the relationships among forms of control. 
I distinguish between recursive and nonrecursive models because the principles 
(equations) for deriving their logical implications are different. The logical im- 
plications of recursive models are given by the principles of path analysis. While 
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these models yield relatively clear and simple implications for the relationships 
between forms of control, even these implications can be more complicated 
when perceived threat is disaggregated into types, such as economic, political, 
and criminal, and when the effect of perceived social threat on specific forms 
of control depends on other variables such as the structure of markets. The im- 
plications of nonrecursive models can be quite complex. While the principles 
for deriving them have been developed, sociologists have rarely used them. 

To reiterate, the theoretical significance of the relationships among forms of 
control lies not in their magnitude and direction but on being able to derive them 
from the causal processes that underlie them. Depending on the direction and 
magnitude of the underlying causal parameters, both recursive and nonrecursive 
models imply relationships among forms of control that can be either positive 
or negative and either weak or strong. 

In conclusion, studies of social control are theoretically dissipating, orga- 
nized more and more around specific forms of control. In an attempt to address 
this state of affairs, some research examines the relationship among forms of 
control. While this strategy is certainly useful, itis not always theoretically fruit- 
ful because the strength and even the direction of these relationships are not nec- 
essarily implied in more general theories of social control. Unless we are pre- 
pared to allow the study of social control to further dissipate, we need to shift the 
focus of study from specific forms of control to the social processes and struc- 
tures that both define general theories of social control and underlie multiple 
forms of control. We must look for theoretical commonalities not in the bivari- 
ate relationships among forms of control, but in the social processes and struc- 
tures that underlie them; and we must then express these bivariate relationships, 
whatever their direction and strength, in terms of these processes and structures. 
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ABSTRACT 

Since the 1980s, immigrant children and children of immigrant parentage have 
become the fastest growing and the most extraordinarily diverse segment of 
America’s child population. Until the recent past, however, scholarly attention 
has focused on adult immigrants to the neglect of their offspring, creating a pro- 
found gap between the strategic importance of the new second generation and the 
knowledge about its socioeconomic circumstances. The purpose of this article 
is to pull together existing studies that bear directly or indirectly on children's 
immigrant experiences and adaptational outcomes and to place these studies into 
a general framework that can facilitate a better understanding of the new second 
generation. The article first describes the changing trends in the contexts of the re- 
ception the new second generation has encountered. The article then discusses the 
ways in which conventional theoretical perspectives about immigrant adaptation 
are being challenged and alternative frameworks are being developed. Thirdly, it 
examines empirical findings from recent research and evaluates their contribution 
to the sociology of immigration. Finally, it highlights the main conclusions from 
prior research and their theoretical and practical implications for future studies. 


INTRODUCTION 


The phenomenal increase in contemporary immigration to the United States has 
given rise to a record number of children who, regardless of place of birth, are 
raised in immigrant families. Since the 1980s, a new generation of immigrant 
children and children of immigrants has become the fastest growing and the most 
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ethnically diverse segment of America’s child population. The 1990 US Census 
has revealed that about 15% of all children in the United States are immigrant 
children or children of immigrant parentage, and that 59% of Latino-American 
children and 90% of Asian American children are members of the first or second 
generation, compared to 6% of non-Latino African-American children and 5% 
of non-Latino European-American children (Landale & Oropesa 1995). 

Differing from their immigrant parents, immigrant children and children of 
immigrants lack meaningful connections to their “old” world. They are thus 
unlikely to consider a foreign country as a place to return to or as a point of 
reference. They instead are prone to evaluate themselves or to be evaluated by 
others by the standards of their new country (Gans 1992, Portes 1995). Given 
the fact that children of contemporary immigrants will represent a crucial com- 
ponent of future American society, how are we to understand these children’s 
adaptation to their role as citizens and full participants in American society? 
How do migration processes, contexts of reception, and biculturalism impact 
the process of becoming American? Has assimilation continued to lead to up- 
ward social mobility? Has the younger generation of today’s immigrants been 
able to assimilate into American society, following the path taken by the “old” 
second generation arriving at the turn of the century and advancing beyond their 
parents’ generation? 

Until the recent past, scholarly attention has focused on adult immigrants to 
the neglect of child immigrants and immigrant offspring, creating a profound 
gap between the strategic importance of these children and the knowledge about 
their conditions. The purpose of this article is to pull together existing stud- 
ies that bear directly or indirectly on immigrant experiences and adaptational 
outcomes of the children of contemporary immigrants and to place these stud- 
ies into a general framework that can facilitate a better understanding of these 
children's socioeconomic circumstances and life chances. In so doing, I first 
describe the changing trends in the contexts of the reception that the new sec- 
ond generation has encountered; I then discuss the ways in which conventional 
theoretical perspectives about immigrant adaptation are being challenged and 
alternative frameworks are being developed. Thirdly, I examine empirical find- 
ings from recent research and evaluate their contribution to the sociology of 
immigration. Finally, I highlight the main conclusions from prior research and 
their theoretical and practical implications for future studies. 


THE RISE OF THE NEW SECOND GENERATION 
Who Makes Up the New Second Generation? 


The new second generation technically refers to the children of contemporary 
immigrants. The emerging literature on the new second generation, however, 
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has discussed not only US-born children—the true second generation—but also 
contemporary immigrant children who have arrived in the United States before 
they reach adulthood (Gans 1992, Portes 1996). The latter group is also known 
as the “one-and-a-half generation," a term coined by Rubén Rumbaut to char- 
acterize the children who straddle the old and the new worlds but are fully part 
of neither (Pérez Firmat 1994, Rumbaut 1991). Usage of these generational 
terms has not been consistent (Oropesa & Landale, forthcoming). Depend- 
ing on social and historical processes of immigration and particular nationality 
groups under study, the second generation is sometimes broadened to include 
foreign-born children arriving at pre-school age (0-4 years) because they share 
many linguistic, cultural, and developmental experiences similar to those of 
immigrant offspring (Zhou & Bankston, forthcoming). The one-and-a-half 
generation, on the other hand, is sometimes broken down into two distinct co- 
horts: children between 6 and 13 years of age as 1.5-generation children and 
those arriving as adolescents (aged 13 to 17) who are similar to first-generation 
children. Although scholars may vary in their ways of defining the new sec- 
ond generation, they have generally agreed that there are important differences 
between children of different cohorts of the one-and-a-half and second gener- 
ation, particularly in their physical and psychological developmental stages, in 
their socialization processes in the family, the school, and the society at large, 
as well as in their orientation toward their homeland. 

The main characteristics of the new second generation mirror those of con- 
temporary immigrants, which are extraordinarily diverse in national origins, 
socioeconomic circumstances, and settlement patterns. Always the defining 
character of America, however, the composition and meaning of diversity today 
have changed significantly since the turn of the century. Most notably, con- 
temporary immigrants have been predominantly non-Europeans. According 
to the Immigration and Nationalization Service, of the 7.3 million immigrants 
admitted to the United States during the 1980s (not counting undocumented 
immigrants), 87% came from Asia and the Americas, compared to the 8.8 mil- 
lion admitted during the 1910s who were predominantly from Europe. In the 
past decade, Mexico, the Philippines, China/Taiwan, South Korea, and Vietnam 
were the top five sending countries, followed by the Dominican Republic, India, 
El Salvador, and Jamaica. Immigrants from Mexico alone accounted for more 
than one fifth of total legal admissions as well as half of illegal immigrants. 

The diversity in national origin has, accordingly, become a salient feature 
of the new second generation. The 1990 Census has shown that the foreign- 
born child population under 18 years of age is made up of 5296 Latinos 
(5396 of whom are Mexicans) and 2796 Asians, and that the US-born co- 
hort with at least one immigrant parent is made up of 48% Latinos and 24% 
Asians (Oropesa & Landale, forthcoming). More strikingly, about one out of 
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three foreign-born children has at least one Mexican-born parent (Perlmann & 
Waldinger 1996). 

Also different from the turn-of-the-century inflows, contemporary immi- 
grants have come from diverse socioeconomic backgrounds. The 1990 Census 
has attested to the vast differences in levels of education, occupation, and income 
by national origins. For example, over 6096 of immigrants (aged 25 years or 
older) from India and Taiwan report having attained college degrees, three times 
the proportion of average Americans, but less than 596 of those from Cambodia, 
Laos, El Salvador, and Mexico so report. Among the employed workers (aged 
16 years or older), over 4596 of immigrants from India and Taiwan have man- 
agerial or professional occupations, more than twice the proportion of average 
American workers, but less than 796 of those from EI Salvador, Guatemala, and 
Mexico so report. Further, immigrants from India report a median household 
income of $53,000, compared to $30,000 for average American households; 
those from Cambodia, the Dominican Republic, El Salvador, Honduras, Laos, 
and Mexico report a median household income below $22,000. 

The socioeconomic diversity suggests that the pathways to social mobility 
will not be a straight line nor unidirectional. While many immigrants continue 
to follow the traditional bottom-up route, significant numbers of new arrivals 
have bypassed the bottom starting line and moved directly into mainstream 
labor markets while dispersing into suburban middle-class communities. The 
implications for the new second generation are profound, since the current state 
and future prospects of immigrant children are related to the advantages or 
disadvantages that accrue to the socioeconomic status of their parents. 

A third salient feature of contemporary immigrants is their geographic con- 
centration. Unlike earlier European immigrants whose destinations were in 
the Northeast, contemporary immigrants have been disproportionately concen- 
trated in the West. California accounted for over a third of the total arrivals of 
legal immigrants in the 1980s, while New York, the traditional largest receiving 
state, accounted for only 14% (Portes & Zhou 1995). The impact of geographic 
concentration is also felt in public schools in high immigration states. Again, 
California alone accounted for some 45% of the nation’s immigrant student 
population, more than one out of ten school-aged children in the state were for- 
eign born, and over a third of the state’s school-aged children spoke a language 
other than English at home (Cornelius 1995). 


The Changing Contexts of Reception 

Differences in national origins, socioeconomic backgrounds, and geographic 
patterns of settlement are important factors for immigrant adaptation. However, 
adaptational outcomes are also determined by structural conditions in the host 
society (Portes & Rumbaut 1996). The context of the reception that has greeted 
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contemporary immigrants and their children has changed dramatically over the 
past three decades to create additional obstacles to "melt" the diverse body of 
immigrants and their offspring into a single mainstream. 

Unlike earlier European immigrants, contemporary immigrants have been 
received in a peculiar circumstance: an emerging “hourglass” economy in which 
opportunities for social mobility shrink even among native-born Americans and 
a welfare state that is highly contested by the general public. Several major 
trends are especially unfavorable for the adaptation of the nation's newcomers 
and their children. 

First, the gap between rich and poor, which progressively narrowed for most 
of the twentieth century, has been widening in recent years as it has been affected 
by globalization and economic restructuring. Only a portion of the American 
work force has seen its economic advantages steadily increase as information 
technology and management become more critical to the economy; most have 
experienced worsening conditions. During the 1980s, 8096 of American work- 
ers saw their real hourly wages go down by an average of about 596 (Mishel 
& Bernstein 1992). Blue collar jobs, the kinds of jobs generally available to 
newly arrived immigrants, not only pay less than in previous years, but they are 
also disappearing at a particularly rapid rate, resulting in expanding classes of 
poor and rich and a shrinking middle class. In such an economic structure, even 
US-born Americans find their chances for economic mobility lessening. The 
situation for many immigrants is bleaker, except for the unusually fortunate, 
the highly educated, and the highly skilled (Waldinger 1996). 

Contemporary economic hardships are different from the hardships of the 
Great Depression and hardships in many Third World countries. Although 
there is a growing class of poor Americans, there are relatively few deaths from 
starvation in the United States. Until the early 1990s, the welfare state had made 
access to public assistance relatively easy (Rumbaut 1994a, Tienda & Liang 
1994). While opportunities for stable jobs with good incomes were rare for low- 
income individuals, food stamps and Aid to Families with Dependent Children 
were readily available. Public assistance did not provide a comfortable way of 
life, for welfare payments averaged less than half the amount defined as poverty 
level income (Sancton 1992:45); it did, nonetheless, provide a means of exis- 
tence for the chronically poor, unemployed or underemployed. Yet, members of 
this expanding class of poor were not being offered chances for socioeconomic 
improvement; they were, for the most part, being fed and housed and main- 
tained in their social and economic limbo. Such unfortunate circumstances 
were exacerbated just prior to the 1996 presidential election when President 
Bill Clinton signed a Republican welfare reform bill. The bill, which limits 
public assistance to two continuous years and mandates a five-year lifetime 
maximum, with neither public jobs nor childcare for recipients who exceed the 
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limit and nothing for their children. This has changed the nature of the welfare 
state in new and significant ways: It cuts off the lifeline of the poor, especially 
children, driving them into deeper poverty; it also excludes legal immigrants 
from much access to basic forms of assistance, forcing poor immigrant families 
to swim or sink. Long-term effects of the welfare bill remain to be seen, but 
certainly millions of children will be thrown into poverty, and chances for the 
truly disadvantaged to get out of poverty will be even bleaker. 

Second, poverty has been highly concentrated. The poor are not, of course, 
being housed evenly across the American landscape. Even before new infor- 
mation technologies and the globalization of production, the contraction of 
American manufacturing and the suburbanization first of the middle class pop- 
ulation and later of middle class jobs have displaced the American working 
class. The disappearance of industrial jobs in urban areas where racial minori- 
ties concentrate has detached the middle class from the working poor, causing 
a high concentration of poverty in the most disadvantaged segments of the 
minority population in inner-city ghettos (Wilson 1987). 

The problem of poverty concentration has worsened under large-scale eco- 
nomic restructuring, which has reduced the demand for low-skilled and semi- 
skilled immigrants and trapped them in unemployment and social isolation sim- 
ilar to that commonly facing native-born minorities in the most impoverished 
stratum of the society. The implication for members of younger generations 
is profound. Immigrant children from middle-class backgrounds benefit from 
financially secure families, good schools, safe neighborhoods, and other sup- 
portive formal and informal organizations, which ensure better life chances 
for them. Children with poorly educated and unskilled parents, in contrast, 
often find themselves growing up in underprivileged neighborhoods subject to 
poverty, poor schools, violence and drugs, and a generally disruptive social 
environment. Many immigrant children attend public schools in their neigh- 
borhood with a clear numerical majority of minority students. In Los Angeles 
County, for example, 57 unified school districts out of a total of 83 contain over 
half of nonwhite students, and 34 have more than three quarters of US-born 
minority and other immigrant students. In major immigrant-receiving cities 
such as Los Angeles, San Francisco, New York, and Miami, at least a third of 
the students in the entire school system speak a language other than English at 
home. 

Third, there has been a drastic increase in the proportion of American children 
in one-parent families. Hernandez (1993) observed that between 1939 and 1988 
the proportion of the officially poor children who lived in mother-only families 
increased from 10% to 57%, and that such an increase was counterbalanced 
by a virtually identical decline in the proportion living in two-parent families. 
The rise of single-parent families has aggravated the overall poverty trends for 
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children. Relative to children living in intact families, the children living in 
one-parent or even blended families tend to be disadvantaged with regard to 
socioeconomic circumstances, psychological function, behavioral problems, 
education, and health and these conditions severely limit their life chances 
(Hernandez 1993). 

Unfortunately, such disadvantages in family situations have worsened for 
poor immigrant families who have lived a longer time in the United States. 
Landale & Oropesa (1995) found significant increases of children living in 
single-parent families across generations of US residence and across many 
Asian and Latin American nationality groups. By the third generation, in par- 
ticular, the prevalence of female headship among all nationality groups of Latin 
American children (ranging from 40% of Mexicans, 50% of Cubans, to 70% of 
Dominicans) and Filipino children (40%) constituted a serious disadvantage. 
This situation implies that even if the parental generation is able to work hard 
to achieve higher positions and higher incomes, their children’s access to these 
gains may be seriously circumvented by family disruption (Landale & Oropesa 
1995). 

Fourth, there has been a growing “oppositional culture” among young 
Americans, especially among those who have felt oppressed and excluded from 
the American mainstream and who have been frustrated by the widening gap 
between a culture that highly values freedom and materialism and the reality of a 
dwindling economic future. Many of these American children have responded 
to their social isolation and their constrained opportunities with resentment 
toward middle-class America, rebellion against all forms of authority, and re- 
jection of the goals of achievement and upward mobility. Because students in 
schools shape one another's attitudes and expectations, such an oppositional 
culture negatively affects educational outcomes. School achievement is seen 
as unlikely to lead to upward mobility, and high achievers are seen as sell-outs 
to oppressive authority. Matute-Bianchi (1986, 1991) found that the relation- 
ship between scholastic achievement and ethnicity did not hold for native-born 
Chicanos and Cholos, who had been uprooted from their Mexican heritage and 
were trapped in a caste-like minority status. They reacted to their exclusion and 
subordination with resentment, regarded efforts toward academic achievement 
as "acting white," and constructed an identity in resistance to the dominant ma- 
jority white society. Suárez-Orozco (1991) reached similar conclusions about 
native-born Mexican Americans, who perceived the effect of the educational 
system as continued exploitation. 

While there is a strong antiintellectual streak in American youth culture at 
all socioeconomic levels, the rejection of academic pursuits is especially in- 
tense among members of minority groups, who are more likely than members 
of the majority to identify school administrations with oppressive authority, to 
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perceive their entry into the middle class as almost impossible, and to be in 
schools where learning is strongly discouraged by peers. It would be wise to 
avoid passing judgment on the children in these schools and to avoid blaming 
them for their responses to the world around them. Merton long ago (1938) 
described rebellion as an adaptive response to a gap between socially approved 
goals and available means of achieving those goals. Lowered chances for mo- 
bility create frustration and pessimism for all American young people, but 
these emotions are most strongly felt by those at the bottom. When those at 
the bottom are also members of historically oppressed minority groups, the 
frustration is mixed with the need to maintain self-esteem, so that rejection of 
middle class mores and opposition to authority become important strategies for 
psychological survival. 

In underprivileged neighborhoods, in particular, immigrant children meet in 
their schools native-born peers with little hope for the future and are thus likely 
to be pressured by their peers to resist assimilation into the middle class as 
expected by their parents. These trends pose a challenge to all parents, but the 
challenge is especially daunting for immigrant parents with limited educational 
backgrounds, frequently limited English skills, and few resources. 


THEORETICAL DEVELOPMENTS IN RESEARCH 
ON IMMIGRANT ADAPTATION 


The Assimilation Perspective Revisited 


In the literature on immigrant adaptation, the assimilation perspective has dom- 
inated much of the sociological thinking on the subject for the most part of this 
century. Central to this perspective is the assumption that there is a natural 
process by which diverse ethnic groups come to share a common culture and to 
gain equal access to the opportunity structure of society; that this process con- 
sists of gradually deserting old cultural and behavioral patterns in favor of new 
ones; and that, once set in motion, this process moves inevitably and irreversibly 
toward assimilation (Park 1928, Warner & Srole 1945, Wirth [1925] 1956). 
While earlier assimilation theorists emphasized forces such as time, indus- 
trialization, and acculturation, Gordon (1964) conceptualized several types 
of assimilation: cultural or behavioral, structural, identificational, attitude- 
receptional, behavior-receptional, and civic assimilation. In Gordon’s view, 
immigrants began their adaptation to the new country through cultural assimi- 
lation, or acculturation. Since cultural assimilation was for Gordon a necessary 
first step, ıt was considered the top priority on the agenda of immigrant ad- 
justment. Gordon implied that acculturation would take place and continue 
indefinitely even when no other type of assimilation occurred (1964, p. 77). 
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But he argued that acculturation did not necessarily lead to other forms of inte- 
gration into the host society (i.e. large-scale entrance into the institutions of the 
host society or intermarriage). Ethnic groups would remain distinguished from 
one another depending largely on the degree to which groups gained the accep- 
tance of the dominant population (p. 159). Gordon anticipated, nevertheless, 
that ethnic minorities would eventually lose all their distinctive characteristics 
and cease to exist as ethnic groups as they pass through the stages of assimi- 
lation, eventually intermarrying with the majority population and entering its 
institutions on a primary-group level (pp. 70-71). 

From the point of view of assimilation theorists, distinctive ethnic traits 
such as old cultural ways, native languages, or ethnic enclaves are sources of 
disadvantages that negatively affect assimilation (Child 1943, Warner & Srole 
1945). Although complete acculturation to the dominant American culture may 
notensure all immigrants full social participation in the host society, immigrants 
must free themselves from their old cultures in order to begin rising up from 
marginal positions. Between the 1920s and the 1950s, America seemed to have 
absorbed the great waves of immigrants, who arrived primarily from Europe. 
Sociological studies have indicated progressive trends of social mobility across 
generations of immigrants and increasing rates of intermarriages, as determined 
by educational attainment, job skills, length of stay since immigration, English 
proficiency, and levels of exposure to American cultures (Alba 1985, Chiswick 
1977, Greeley 1976, Sandberg 1974, Wytrwal 1961). 


Observed Anomalies 

Beginning in the 1960s, the conventional assimilation perspective with its ap- 
plication to the more recently arrived non-European immigrant groups has 
been met with challenges. Instead of eventual convergence in the outcomes 
of immigrant adaptation, several anomalies have been observed in recent re- 
search. The first anomaly concerns the persistent ethnic differences across gen- 
erations. Studies on intergenerational mobility have found divergent rather 
than convergent outcomes, revealing that early and insignificant differentials in 
advantage result in substantial differences in educational and occupational mo- 
bility in later years (Becker 1963, Goffman 1963). In their study of educational 
attainment of 25 religio-ethnic groups in the United States, Hirschman & Falcon 
(1985) have found that neither generation nor length of US residence signifi- 
cantly influences educational outcomes. Specifically, children of highly edu- 
cated immigrants consistently fare much better 1n school than do fourth- or fifth- 
generation descendants of poorly educated ancestors regardless of religio-ethnic 
backgrounds. In a study of the Irish, Italian, Jewish, and African Americans 
in Providence, Rhode Island, Perlmann (1988) found that, even with family 
background factors held constant, ethnic differences in levels of schooling and 
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economic attainment persisted in the second and later generations and that 
schooling did not equally commensurate with occupational advancement for 
African Americans as for other European-Americans across generations. 

Another anomaly is what Gans (1992) describes as the “the second gen- 
eration decline.” Gans notes three possible scenarios for today’s new second 
generation: education-driven mobility, succession-driven mobility, and niche 
improvement. He observes that immigrant children from less fortunate so- 
cioeconomic backgrounds have a much harder time than other middle-class 
children to succeed in school, and that a significant number of the children of 
poor, especially dark-skinned, immigrants can be trapped in permanent poverty 
in the era of stagnant economic growth and in the process of Americanization 
because these immigrant children "will either not be asked, or will be reluctant, 
to work at immigrant wages and hours as their parents did but will lack job 
opportunities, skills and connections to do better" (pp. 173—74). Gans predicts 
that the prospects facing children of the less fortunate may be high rates of 
unemployment, crime, alcoholism, drug use, and other pathologies associated 
with poverty and the frustration of rising expectation (p. 183). Perlmann & 
Waldinger (1996) call this phenomenon “the second generation revolt.” They 
argue that such revolt is not merely caused by exogenous factors, such as racial 
discrimination, declining economic opportunities, and the exposure to the ad- 
versarial outlooks of native-born youths, but also by endogenous factors inher- 
ent in the immigration process, including pre-immigration class standing and 
the size and the nature of immigrant inflows. 

Still another anomaly 1s the peculiar outcomes of immigrant adaptation. To- 
day, neither valedictorians nor delinquents are atypical among immigrant chil- 
dren regardless of timing and racial or socioeconomic backgrounds. For exam- 
ple, in the past 15 years, the list of top-ten award winners of the Westinghouse 
Science Talent Search, one of the country's most prestigious high school aca- 
demic contests, has been dominated by the 1.5- or second-generation immi- 
grants. Many of these immigrant children are "FOBs" (fresh off the boat) and 
from families of moderate socioeconomic backgrounds (Zhou 1997). While 
immigrant children are overrepresented on lists of award-winners or on aca- 
demic fast tracks, many others are extremely vulnerable to multiple high-risk 
behaviors, school failure, street gangs, and youth crime. Even Asian Americans, 
the so-called “model minority,” have seen a steady rise of youth gang mem- 
berships. Some of the Asian gang members are from suburban middle-class 
families, attend magnet schools, and are exceptionally good students. These 
anomalies immediately question the applicability of straight-line assimilation. 


Alternative Theoretical Frameworks 


A major alternative framework is the pluralist perspective, which perceives 
American society as composed of a collection of ethnic and racial minority 
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groups, as well as the dominant majority group of European Americans. It is 
concerned with a fundamental question of how the world would look differ- 
ent were the experiences of the excluded placed at the center of our thinking 
and with the ways in which immigrants actively shape their own lives rather 
than exist passively as beneficiaries or victims of "ineluctable modernizing and 
Americanizing forces" (Conzen 1991, p. 10). A central idea is that ethnicity 
can serve as an asset rather than a liability. This idea provides a means of 
understanding how ethnicity may be utilized as a distinct form of social capital 
(including such cultural endowments as obligations and expectations, infor- 
mation channels, and social norms) that contributes to adaptation (Glazer & 
Moynihan 1970, Handlin 1973). 

From a pluralist point of view, preimmigration cultural attributes inherent to 
ethnicity are not necessarily absorbed by the core culture of the host society, 
but they constantly interact with it. Greeley contended that "ethnicity is not a 
way of looking back to the old world ... [but] rather a way of being Ameri- 
can, a way of defining yourself into the pluralist culture which existed before 
you arrived" (1976, p. 32). Conzen and her associates conceptualized ethnic- 
ity as “a process of construction or invention which incorporates, adapts, and 
amplifies preexisting communal solidarities, cultural attributes, and historical 
memories," grounded in real-life context and social experience (Conzen et al 
1992, pp. 4—5). According to these scholars, preimmigration cultural attributes 
cannot be equated with homeland cultures, because immigrants tend to select 
carefully not only what to pack in their trunks to bring to America, but also 
what to unpack once settled. Also, homeland cultural norms and values may 
not be entirely inconsistent with those of the host country. Just as some aspects 
of immigrant cultural patterns may continue in a state of uneasy coexistence 
with the requirements of the host country, other aspects of immigrant cultural 
patterns may "fit" the requirements of life there or may even be prerequisites for 
"making it in America” (Fukuyama 1993). Still others are modified, changed, 
adapted, transformed, reformed, and negotiated in the course of immigrant 
adjustments (Garcia 1996). 

The pluralist perspective offers an alternative way of viewing the host society, 
treating members of ethnic minority groups as a part of the American popula- 
tion rather than as foreigners or outsiders and presenting ethnic or immigrant 
cultures as integral segments of American society. However, the questions of 
"second-generation decline" and “second-generation revolt" have been unan- 
swered within this theoretical framework. While how people construct or invent 
their own ethnicity has been emphasized, how they also construct their own ac- 
culturation and assimilation has been understudied. Gans (1992) points out that 
pressures of both formal acculturation (through schooling) and informal accul- 
turation (through American peers and the media) will impinge on the second 
generation. He reasons that immigrant children may be so overwhelmed by a 
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youth culture and the freedoms (particularly personal choices in dress, dating, 
sexual practices) unavailable in their old country that, because of the sheer at- 
tractiveness of American culture, they may not be willing to accept immigrant 
parental work norms or to work in “un-American” conditions as many of their 
parents do, and that they may be unwilling to endure their parents’ painstaking 
efforts toward upward social mobility. Perlmann & Waldinger (1996), how- 
ever, caution that the deteriorating prospect in the new second generation is not 
simply a matter of exposure or Americanization but also a result of structural, 
group, or individual disadvantages associated with pre- and post-immigration 
experiences. 

Moreover, the elusiveness of ethnic characteristics creates problems in the 
use of the pluralist framework as an explanatory tool. Each generation passes 
cultural patterns, often subtle patterns, to the next, but the mechanisms of this 
process are unclear, and many assumptions and attitudes of ethnic group mem- 
bers are hard to identify and measure (Archdeacon 1983). Also, the constituents 
of American diversity are not equal; maintaining a distinctive ethnicity can 
both help and hinder the social mobility of ethnic group members. For exam- 
ple, first-generation members of some immigrant minority groups, such as the 
Mexicans, have seldom been able to motivate their children to excel in school 
and move upward in the host society, while other groups, such as the Asians, 
have far more often succeeded in pushing younger people toward upward social 
mobility (Perlmann & Waldinger 1996). 

Another major theoretical stance is the structural perspective, which offers a 
framework for understanding the differences in social adaptation of ethnic mi- 
nority groups in terms of advantages and disadvantages inherent to social struc- 
tures rather than in the process of acculturation or selective Americanization. 
This perspective presents American society as a stratified system of social in- 
equality, in which different social categories—whether birth-ascribed or not— 
have unequal access to wealth, power, and privilege (Barth & Noel 1972). The 
ethnic hierarchy systematically limits access to social resources, such as oppor- 
tunities for jobs, housing, and education, resulting in persistent racial/ethnic dis- 
parities in levels of income, educational attainment, and occupational achieve- 
ment (Wilson 1987). Consequently, the benefits of "becoming American" de- 
pend largely on what stratum of American society absorbs the new immigrants. 

Overall, the structural perspective raises skepticism about eventual assimila- 
tion and interethnic accommodation suggested by the assimilation perspective 
and implied by the pluralist perspective, because of inherent conflicts between 
the dominant and subordinate groups in the hierarchy. On the issue of immigrant 
adaptation, this perspective maintains that the process of becoming American 
may not lead uniformly to middle-class status, but rather to the occupation of 
different rungs on the ethnic hierarchy. 
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The segmented assimilation thesis provides another framework for examin- 
ing the divergent outcomes of today’s new second generation. Portes & Zhou 
(1993) observe that assimilation has continued to serve as a norm of immigrant 
adaptation, but that its outcomes have become segmented: Either confinement 
to permanent underclass memberships or rapid economic advancement with 
deliberate preservation of the immigrant community’s values and solidarity 
is equally possible for the new second generation. This segmented assimila- 
tion thesis recognizes the fact that today’s immigrants are received in various 
segments of American society ranging from affluent middle-class suburbs to 
impoverished inner-city ghettos. Such contextual differences mean that paths 
to social mobility may lead to upward as well as downward outcomes. In the 
case of those who start from the very bottom, of course, the outcome is not so 
much assimilating downward as staying where they are. 

The question is what makes some immigrant groups susceptible to the down- 
ward path, or to the permanent trap, and what allows others to avoid it? Major 
determinants can include factors external to a particular immigrant group, such 
as racial stratification, economic opportunities, and spatial segregation, and 
factors intrinsic to the group, such as financial and human capital upon arrival, 
family structure, community organization, and cultural patterns of social rela- 
tions. These two sets of factors affect the life chances of immigrant children 
not only additively but also interactively. Particular patterns of social relations 
in the family or the ethnic community may sometimes counter the trend of 
negative adaptation even in unfavorable situations. When immigrant children 
are under pressure to assimilate but are unsure which direction of assimilation 
is more desirable, the family or the ethnic community can make a difference if 
it is able to mobilize resources to prevent downward assimilation. The focus 
on the interaction between structural factors and sociocultural factors in recent 
research has shed new light on the understanding of the complex process of 
assimilation in the second generation. 
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EMPIRICAL FINDINGS FROM CURRENT RESEARCH 


Unlike adult immigrants whose levels of adaptation are often indicated by occu- 
pational attainment and income, levels of adaptation among young immigrants 
are generally measured by educational attainment—academic orientation, as- 
piration, and performance. Attending school—attaining knowledge and skills 
that may be capitalized upon in future labor markets—is a crucial first step 
toward successful adaptation to American society for immigrant children and 
children of immigrants. In the United States where public education is readily 
available to all children and where education is traditionally accepted as the 
main means of socioeconomic mobility, schooling often comes to occupy a 
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central place in immigrant aspirations (Ogbu 1974). Studies on the new second 
generation have generally concerned a central question: How do various as- 
pects of class, race/ethnicity, social capital derived from particular pattetns of 
social relations within the immigrant family and the ethnic community, inter- 
generational relations, language skills, and ethnic identity affect the process of 
educational attainment of the children of contemporary immigrants? In this 
section, I review how these issues are addressed in recent research and show 
how new concepts are derived from empirical findings. 


Class and Race/Ethnicity 


The socioeconomic circumstances of today’s predominantly non-European- 
American second generation vary by race and ethnicity. Although many of 
them may have never experienced prejudice associated with a particular skin 
color or racial type in their homelands, immigrant children have confronted 
a reality in their host society where their ascribed physical features may be- 
come a handicap (Waters 1994, Portes 1995). Using the 1990 Census data, 
Oropesa & Landale (forthcoming) showed that poverty rates for immigrant 
children ranged from 21% among non-Latino European Americans, 24% for 
non-Latino African Americans, 27% for Asian Americans, and 41% for Latino- 
Americans. Among the second generation (US-born with at least one foreign- 
born parent), there was a substantial drop in poverty rates for all racial groups, 
but the magnitude of the decline varied by race: while poverty rates between the 
first- (or the 1.5-) and second generation dropped more than half among non- 
Latino European-American and Asian-American children, they dropped less 
than a third among non-Latino African-American and Latino-American chil- 
dren. The conditions for third-generation children (US-born children with US- 
born parents) were most disturbing: Except for Asian Americans, there was no 
appreciable socioeconomic improvement between second- and third-generation 
non-Latino European Americans and Latino Americans, but there was a sig- 
nificant deterioration among third-generation non-Latino African Americans 
whose poverty rate jumped up to 40%, a 26 percentage-point increase from that 
of their first-generation counterparts. These statistics reveal an obvious effect 
of race, implying a severe handicap associated with skin colors. 

Why has the class status changed for the better among children of certain 
racial minority groups and for the worse among others with generation? Some 
researchers contend that the inequalities of class and race that plague American 
society are carried into the American educational system. Schools have thus 
become “arenas of injustice" (Keniston et al 1977) that provide unequal oppor- 
tunities on the basis of class and race. Coleman and his associates (1966) found 
that social capital in the form of parental educational attainment and class status 
significantly affect academic performance of children in school and that children 
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did better if they attended schools where classmates were predominantly from 
higher socioeconomic backgrounds. Davis (1993) found that poor African- 
American and Latino-American families who moved from inner-city neighbor- 
hoods did better in school and in labor markets than those left behind. The pat- 
tern generally held true for immigrant children who attended suburban schools. 

Moreover, differences in outcomes of schooling have historically been linked 
to residential segregation on the basis of class and race. Minority children 
have suffered from unequal distribution of economic and educational resources 
that seriously curtail their chances in life and trap them in isolated ghettos. 
Ghettoization, in turn, produces a political atmosphere and a mentality that 
preserve class divisions along racial lines, leading to the greater alienation 
of minority children from American institutions and further diminishing their 
chances for upward mobility (Fainstein 1995). 

But how would one account for the fact that immigrant children tend to 
do better than their US-born peers of similar socioeconomic background and 
attending public schools in the same neighborhood? Ogbu (1974) attributed 
different outcomes to a group’s social status in the receiving society. He dis- 
tinguished between immigrant/voluntary minorities and castelike/involuntary 
minorities. In his line of reasoning, either group members of racial minorities 
could accept an inferior caste status and a sense of basic inferiority as part 
of their collective self-definition, or they could create a positive view of their 
heritage on the basis of cultural and racial distinction, thereby establishing a 
sense of collective dignity (also see De Vos 1975). 

While this was true for both immigrant minorities and caste-like minorities, 
the difference lay in the advantageous or disadvantageous aspects of racial or 
group identity. Ogbu (1989) showed from his research on Chinese-American 
students in Oakland, California, that in spite of cultural and language differences 
and relatively low economic status, these students had grade point averages that 
ranged from 3.0 to 4.0. He attributed their academic success to the integration 
of these students into the family and the community, which placed high values 
on education and held positive attitudes toward public schools. 

The benefits of deliberate cultivation of ethnicity also appeared in the work of 
Gibson (1989), who found that the outstanding performance of Punjabi children 
in a relatively poor rural area of Northern California was a result of parental 
pressure put on children to adhere to their own immigrant families and to avoid 
excessive Americanization. Similarly, Caplan and his associates (1989) found 
that Indochinese refugee children (not including Cambodians and Hmongs) ex- 
celled in the American school system, despite the disadvantaged location of their 
schools and their parents’ lack of education and facility with English. Caplan 
and his associates attributed academic achievement to cultural values and prac- 
tices unique to Indochinese families. Even among Indochinese refugees, the 
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ethnic effect was significant. Rumbaut & Ima (1988) found that Vietnamese 
high school students did much better in both GPAs and test scores than their 
Cambodian and Laotian peers, and that, overall, the strongest predictor of GPA 
was the measure of ethnic resilience. 

More recently, Portes & Rumbaut reported findings from a large random 
sample of second-generation high school students in Florida and southern 
California, showing that parents' socioeconomic status, length of US resi- 
dence, and homework hours significantly affected academic performance, but 
that controlling for these factors did not eliminate the effect of ethnicity (Portes 
& Rumbaut 1996, Rumbaut 1995, 1996). Kao & Tienda (1995) found, based on 
data from the National Education Longitudinal Studies (known as NELS), that 
parental nativity and children's birthplace had different effects on children's aca- 
demic outcomes depending on race and ethnicity. Portes & MacLeod (1996), 
also using NELS, reported that the negative effect of disadvantaged group mem- 
berships among immigrant children was reinforced rather than reduced in sub- 
urban schools, but that the positive effect of advantaged group memberships 
remained significant even in inner-city schools. 

In the most recent research on adolescent development, though originally not 
intending to focus on ethnic differences, Steinberg (1996) revealed a surpris- 
ingly prominent and strong role that ethnicity played in structuring adolescents' 
lives, both in and outside of school. He found that Asian-American students out- 
performed European-American students who, in turn, outperformed African- 
American and Latino-American students by significantly large margins, and 
that the ethnic differences remained marked and consistent across nine dif- 
ferent high schools under study and after controlling for social class, family 
structure, and place of birth of parents. He also found that the ethnic effect 
persists in important explanatory variables of school success, such as the belief 
in the payoff of schooling, attributional styles, and peer groups. Steinberg con- 
cluded that ethnicity emerged as just as important a factor as social class and 
gender in defining and shaping the everyday lives of American children. 

However, the advantage of ethnicity may be limited for caste-like minorities. 
If a socially defined racial minority group wishes to assimilate but finds that 
normal paths of integration are blocked on the basis of race, the group may be 
forced to take alternative survival strategies that enable them to cope psycho- 
logically with racial barriers but that do not necessarily encourage school suc- 
cess. Further, caste-like involuntary minorities may react to racial oppression 
by constructing resistance both as conformity— "unqualified acceptance of the 
ideological realm of the larger society"—and, more frequently, as avoidance— 
“willful rejection of whatever will validate the negative claims of the larger 
society" (Fordham 1996:39). As a consequence, it is the willful refusal to 
learn, not the failure to learn, that affects academic outcomes of the children of 
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caste-like involuntary minorities (Kohl 1994, p. 2). The forced-choice dilemma 
confronting Chicano and Puerto Rican youth is a case in point, Gibson (1989) 
and Bourgois (1991) found that Chicano students and Puerto Rican students 
who did well in school were forcefully excluded by their coethnic peers as 
"turnovers" acting "white." 

Nonetheless, not all immigrant groups can fit into the category of immi- 
grant/voluntary minority. In the case of Dominican immigrants, Pessar (1987) 
noted that many first-generation members of the group were able to improve 
their living standards by pooling resources in their households and that they 
were mostly satisfied with what they had achieved, comparing their lives in 
America to their lives in the Dominican Republic. However, she cast doubt 
on whether the struggle of first-generation immigrants would steer the second 
generation to upholding their parent's aspirations and fulfilling their own ex- 
pectations of socioeconomic mobility. She speculated that Dominican children 
were likely to be frustrated and disappointed if they found themselves trapped 
at the lower rungs of the occupational ladder, because of “blatant discrimina- 
tion" and “lack of access to prestigious social networks" linking them to higher 
professions (1987, pp. 124—125). Portes & Stepick (1993) and Waters (1996) 
also noted such a trend among Haitian youth in Miami and West Indian youth in 
New York City toward rapid assimilation into ghetto youth subcultures, at the 
cost of giving up their immigrant parents’ pride of culture and hopes for mo- 
bility on the basis of ethnic solidarity. This prospect of downward assimilation 
was also expected to disproportionately affect children of Mexican immigrants 
(Perlmann & Waldinger 1996). 

Consequently, subordinate groups may react to their disadvantaged status 
with different strategies: Some may rely on social capital available in their 
own ethnic communily to actively fight for acceptance by the larger society; 
others may consciously reject the ideology and norms of the larger society by 
reconstructing an ethnicity in resistance to the oppressing structure; stil] others 
may give up their struggle and remain trapped in the bottom of the society. 


The Family, the Community, and Networks 
of Social Relations 


In the United States, the family is the most important institutional environment 
outside of school for socialization, adaptation, and the future social mobility of 
children. Success in school, one of the most important indications of adapting 
to society, depends not only on the cognitive ability and motivation of individ- 
ual children, but also on the economic and social resources available to them 
through their families. 

Socioeconomic status is certainly one of the most important characteristics 
of the family context because it influences where children live and where they 
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go to school. While wealthier immigrant families are able to settle directly in 
suburban middle-class communities, most others are forced to settle in declining 
urban areas, starting their American life either in poverty or on welfare as in the 
case of refugees. The low socioeconomic status of those immigrant families 
just arriving subjects children directly to underprivileged segments of the host 
society and the associated disadvantages and pathologies. 

However, socioeconomic status is not all that counts; just as important are 
family structures and their embedded family ties. Recent research has shown 
that immigrant children from intact (especially two-natural-parent) families or 
from families associated with tightly knit social networks consistently show 
better psychological conditions, higher levels of academic achievement, and 
stronger educational aspirations than those in single-parent or socially isolated 
families. Results from the survey of immigrant children in San Diego and Mi- 
ami revealed that, regardless of race/ethnicity, immigrant students who retained 
strong cultural and family identity tended to outpace others in school, including 
their native-born European-American peers, because their immigrant families 
reinforced the values of hard work and educational achievement (Rumbaut 
1994b, 1996, Portes 1995, Portes & Schauffler 1994). The case study on im- 
migrant youth from troubled Central American countries showed that high- 
achievement motivation was significantly related to a strong sense of group 
affiliation, family loyalty, and obligation in helping their less fortunate relatives 
and folks still trapped in war and misery in their homeland (Suárez-Orozco 
1989). The case study of Vietnamese youth in a poor neighborhood in New 
Orleans found that Vietnamese high school students who reported strong ori- 
entations toward traditional family values of obedience, industriousness, and 
helping others were more likely to do well in school than those who did not. 
However, the effects of independent thinking and concern with individual social 
prestige, which were most commonly associated with contemporary American 
society, were insignificant (Zhou & Bankston 1994). These consistent findings 
have highlighted the central role the family plays in the lives of immigrants and 
their children. 

If the presence of both parents at home and well-connected family ties are 
considered sources of social capital, the loss, or truncation, of the family system 
can reduce the access to social resources available to children (Fernández-Kelly 
1995, Rumbaut 1996). Most immigrant households in the United States are nu- 
clear rather than extended families. While migration extends social and famil- 
ial ties across national borders that facilitate further migratory flows (Landale 
1996), it simultaneously disrupts the traditional family system upon arrival both 
in the immigrant fainily itself and in the social network to which the family is 
a part. In some cases, family disruption can cause serious problems for chil- 
dren's upbringing in the American context, when the family is separated by 
national borders and isolated from the ongoing networks of kinship relations 
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in the homeland. The so-called “relayed migration” (Sung 1987) or “serial 
migration” (Waters 1996), a process under which members of an immigrant 
family arrive (both intentionally and mvoluntarily) at different times, can strain 
parent-child and sibling relationships. The truncation of family networks, a sit- 
uation where routine interactions among kins and former neighbors or friends 
are broken, can weaken traditional mechanisms of control and support. 

In other cases, however, immigrant families are able to mobilize ethnic re- 
sources to reconstruct systems of family ties in the United States by shifting 
and expanding the criteria for inclusion in the family circle. Many Vietnamese 
refugee families, for example, are broken down ın fragments of nuclear or ex- 
tended families as a common strategy of flight. These smaller units are some- 
times further fragmented by leaving grandparents and younger children, or 
wives and younger children, behind in the hope of bringing them out after the 
fleeing family is resettled. As a resuli, some of the families may be extended to 
contain distantly related or even unrelated members, while other families may 
have closely related members left behind in Vietnam (Zhou & Bankston, forth- 
coming). Once resettled, however, families, friends, and distant relatives who 
were marginal members of the family circle in Vietnam may become part of 
the active circle of kin relations in the United States (Caplan et al 1989, Kibria 
1993). This reconstructed family pattern has often given rise to a sense of col- 
lective strength in coping and to new mechanisms in reestablishing social ties 
and cooperative kin-based economic practices for immigrants to support one 
another in an alien environment (Kibria 1993, Perez 1994, Zhou & Bankston 
1994). Studies on immigrant families of different ethnic groups have revealed 
similar patterns of family reconstruction (Booth et al 1996). 

Even when immigrant families are maintained by two parents or extended 
kin and by a high level of conformity to traditional values, these families cannot 
function effectively in isolation, especially under unfavorable socioeconomic 
situations when the means of attaining family goals of “making itin America" is 
disrupted by the day-to-day struggle for survival and by the adversarial subcul- 
tures of the underprivileged segment of American society surrounding many 
immigrant families. How is it possible to ensure that immigrants and their 
offspring maintain their values and work habits and learn the skills for socioe- 
conomic advancement? An answer to this question requires something more 
than a check-list of socioeconomic characteristics of individual families. The 
key is to examine the networks of social relations, namely how individual fam- 
ilies are related to one another in the ethnic community and how immigrant 
children are involved in these networks. 

The networks of social relations involve shared obligations, social supports, 
and social controls. When, for example, Korean Americans obtain from other 
Korean Americans low-interest loans requiring little collateral, or Chinese- 
American students receive encouragement and approval in after-school Chinese 
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language classes for their general academic orientations, these are forms of so- 
cial support inherent in particular patterns of social relations within the ethnic 
community. When, on the other hand, a group member experiences disapproval, 
or even ostracism, from co-ethnics for failing to attain a respected occupation, 
this is a form of social control. Zhou & Bankston (1994, forthcoming) proposed 
a model of ethnic social relations based on their study of Vietnamese adoles- 
cents in New Orleans. In the Vietnamese community in New Orleans, they 
observed that Vietnamese adolescents were constantly reminded of their duty 
to show respect for their elders, to take care of younger siblings, to work hard, 
and to make decisions upon approval of parents not simply within a particu- 
lar family but in the community where other families practiced similar values. 
In this “watchful and ever-vigilant" community, young Vietnamese found little 
competition from other desiderata because the social world of their families was 
restricted to the closed and highly integrated circles of the ethnic group. Since 
what was considered good or bad was clearly specified and closely monitored 
by these networks, young people found it hard to “to get away with much.” 
The researchers concluded that the conformity to traditional family values and 
behavioral standards required a high level of family integration into a commu- 
nity that reinforced these values and standards. The outcomes of adaptation, 
therefore, depend on how immigrant children fit in their own ethnic commu- 
nity, or in their local environment if such an ethnic community is absent, and 
how their ethnic community or the local environment fit in the larger American 
society. In the case of the Vietnamese, being part of a Vietnamese network 
appears to offer a better route to upward mobility than being Americanized into 
the underprivileged local environment, or for that matter, into the native-born 
mainstream youth subcultures. 

While family ties function as an important source of support and control, 
recent research has found evidence to indicate that the cohesion of family ties 
tends to deteriorate with longer duration of US residence, as in the case of 
refugees from Central America (Gil & Vega 1996). Researchers have also 
cautioned that even strong cultural identities and social ties, which may be con- 
sidered as sources of social capital, may sometimes be insufficient because of 
racial or class disadvantages. In a study of a ghetto African-American commu- 
nity, Stack (1974) showed that African-American families depended on patterns 
of co-residence, kinship-based exchange networks for survival. This means of 
survival, however, demanded the sacrifice of upward mobility and geographic 
movement, and it discouraged marriage, because of structural constraints such 
as the inexorable unemployment of African-American women and men. Wel- 
fare policies disrupted the support networks and conspired against the ability 
of the poor to build up equity. Similarly, Fernández-Kelly (1995) found, in 
a study of teenage pregnancies in a Baltimore ghetto, that kinship networks 
in ghettos were often graced with strong family and friendship bonds but that 
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these networks lacked connections to other social networks that controlled ac- 
cess to larger sets of opportunities. Moreover, symbols of ethnic pride and 
cultural identity that developed in reaction to social isolation and racial domi- 
nation (e.g. the sparkling mounds of braided hair of young African-American 
women) became signals that barred access to resources and employment in the 
larger society. Such truncated networks and reactive ethnicity could severely 
limit the ability of children to envision alternative paths out of the ghetto and 
to turn cultural capital into resourceful social capital (Fernández-Kelly 1995, 
Fordham 1996, Kohl 1994). 


Intergenerational Relations 


The clash between two social worlds is the most commonly cited problem of 
intergenerational relations. In fact, intergenerational conflicts are not simply 
a unique immigrant phenomenon (Berrol 1995, Child 1943); they are also an 
American phenomenon rooted in the American tradition of a “moral rejection 
of authority” (Gorer 1963:53). In a recent study of Latino adolescents, Suárez- 
Orozco & Suárez-Orozco (1995) found that intergenerational conflicts are more 
common among European-American adolescents who are more ambivalent 
toward authority and schooling and are more peer-oriented than among Latino- 
American adolescents who are more respectful of authority and more family- 
oriented. They attribute this gap to the impact of changing American youth 
culture that glorifies contempt for authorities and a peer orientation, implying 
that assimilating into the American youth culture may cause more harm than 
good for immigrant adolescents. 

In the United States, immigrant children often become Americanized so 
quickly that their parents cannot keep up with them. There is a fear in the 
older generation that their children will leave them, become like other Ameri- 
can youth, and forget about their roots. This fear, however, has originated not 
from the process of acculturation but from the migration process itself. Mi- 
gration disrupts normal parent-child relationships in a number of observable 
ways, shown in a number of case studies and surveys on immigrant children 
(Berrol 1995, Kibria 1993, Rumbaut 1994b, Sung 1987, Waters 1996, Zhou 
1997). First, many immigrant families suffer lengthy separation from the fa- 
ther or mother or older siblings because of delayed reunification. When all the 
members are reunited, they have to make an effort to adapt to each other in a 
new situation. Second, working outside the home has become the norm and an 
economic necessity for women. This work role gives women some measures 
of independence but creates difficulty in child-rearing and weakens the status 
and authority of men. Also, face-to-face interaction between parents and chil- 
dren decreases as both parents are usually out working for long hours. Third, 
because parents lack proficiency in English, children often act as interpreters 
and translators for their parents. Such role reversal usually leads to greater 
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dependence of parents on children and a loss of parental authority. Meanwhile 
members of the younger generation are anxious that they might never become 
“American” because of these intrinsic family ties. 

Immigrant children and their parents tend to perceive their host society and 
their relationships with it from different angles. The younger generation tends 
to focus on current adjustment, paying attention to the external traits of what 
they have come to define as being “American.” They struggle to fit in based 
on a frame of reference that they have acquired from their American peers and 
from television and other forms of mass media. They often find themselves 
confused by such questions as: How do I fit into American culture and my own 
ethnic culture at the same time? Which side should I stay loyal to, American or 
my own ethnic culture? Can I ever become American without leaving home? 
At times, they feel embarrassed by their parents’ “old” ways. 

Parents, on the other hand, are primarily concerned both with making the best 
of a new environment and with retaining traditional family life. These parental 
concerns tend to lead them to focus on the future and to emphasize discipline 
and scholastic achievement. When children respond to these emphases in an 
unexpected way, parents puzzle: Why are my children so disrespectful? How 
can I make my children understand that everything I am doing is for their own 
good? Can’t they understand that I wouldn’t have chosen a life here if it hadn’t 
been for them? What should I do to keep my children from losing their cultural 
roots and from assimilating too much? 

The frequent difficulties facing the new second generation arise from the 
struggles of individuals to balance the demands of American culture with those 
of tradition-minded parents (Dublin 1996). Portes & Rumbaut (1996, Chap- 
ter 7) conceptualize the acculturation gaps between immigrant parents and 
their children in a typology of “generational consonance versus dissonance.” 
Generational consonance occurs when parents and children both remain unac- 
culturated, or both acculturate at the same rate, or both agree on selective ac- 
culturation. Generational dissonance occurs when children neither correspond 
to levels of parental acculturation nor conform to parental guidance, leading 
to role reversal and intensified parent-child conflicts. According to Portes & 
Rumbaut, these acculturation patterns interact with contextual factors—racial 
discrimination, urban subcultures, and labor market prospects—to affect adap- 
tational outcomes of children. When contextual factors are unfavorable, as is 
the case confronting the majority of today’s second generation, consonant ac- 
culturation enables immigrant children to lean on material or moral resources 
available in the family and the immigrant community; it thus increases the 
probability of upward assimilation. On the contrary, dissonant acculturation 
severs ties between children and their adult social world, deprives children of 
family or community resources, and leads them farther and farther away from 
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parental expectations. In this situation, immigrant children are likely to rebel 
against parental educational expectations and to assimilate into an adversarial 
academic orientation in response to discrimination, subcultural pressures, and 
blocked mobility, as exemplified by Haitian children in Miami and West Indian 
children in New York City (Portes & Stepick 1993, Waters 1994, 1996). 

What determines intergenerational conflicts? Using a large random sample 
of over 5000 immigrant children in San Diego and Miami, Rumbaut (1994b) 
examined the possible effects of a number of objective and subjective pre- 
dictors that measure children’s demographic characteristics, family situations, 
language use, academic performance, and discrimination. He found that parent- 
child conflicts were significantly less likely to occur in families with both natural 
parents at home and with parents or siblings readily available to offer help with 
homework. But among females, conflicts were significantly more likely to occur 
in families where the mother was less educated and where economic well-being 
was perceived as having worsened, where children felt embarrassed by parents 
and had nobody to help with homework at home. Tensions were also likely to 
be exacerbated among children who preferred to speak English at home, who 
had low GPAs and educational aspirations, who spent much time watching tele- 
vision and too little time on homework, and who experienced discrimination or 
perceived themselves as being discriminated against. 

Intergenerational conflicts, in turn, lead to dwindling parental authority and 
insufficient family communications and have significantly negative effects on 
children’s self-esteem, psychosocial well-being, and academic aspirations (Gil 
& Vega 1996, Rumbaut 1996, Szapocznik & Hernandez 1988). However, 
whether the effects on children’s adaptation are deleterious also depends on 
the specific context of exit from countries of origin and the context of reception 
upon arrival (Portes & Rumbaut 1996). Several studies on immigrant children 
have reported that intergenerational conflicts within families do not necessarily 
frustrate successful adaptation to a host society (Schulz 1983). The ethnic social 
structure can sometimes play an important role in mediating between individual 
families and the larger social setting. Immigrant children and parents often in- 
teract with one another in immigrant communities. If patterns of interaction are 
contained within a tightly knit ethnic community, these children and parents are 
likely to share their similar experiences with other children and parents. In this 
way, the community creates a buffer zone to ease the tension between individual 
self-fulfillment and family commitment. The community also serves to moder- 
ate original cultural patterns, to legitimize reestablished values and norms, and 
to enforce consistent standards. This situation resembles Sung’s description of 
immigrant children in New York’s Chinatown in the mid-1980s. Sung observed: 


For Chinese immigrant children who live in New York's Chinatown or in satellite China- 
towns, these [bi-cultural] conflicts are moderated to a large degree because there are other 
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Chinese children around to mitigate the dilemmas that they encounter When they are 
among their own, the Chinese ways are better known and better accepted. The Chinese 
customs and traditions are not denigrated to the degree that they would be 1f the immigrant 
child were the only one to face the conflict on his or her own (1987, p. 126). 


Sung’s finding suggests that frequent interactions with co-ethnics within an 
ethnic community could help young ethnic group members to develop a sense 
of identity that would ease bicultural conflicts. Similarly, in their case study 
of the Vietnamese in New Orleans, Zhou & Bankston (forthcoming) found 
that the bicultural tension did not produce rebellion; on the contrary, being both 
Vietnamese and American frequently caused children to achieve superior levels 
of performance. They reasoned that, first, parents did not just bring with them 
a desire to maintain traditional cultural patterns. The parents also, as a result 
of the process of migration and their own struggles for survival in a new land, 
developed a strong orientation toward upward mobility. Beliefs about parent- 
child relations were combined with this mobility orientation and modified by 
it Second, parents were not alone in their efforts to control and guide their 
children. They maintained relations in closely knit ethnic networks that served 
to reinforce parental expectations and acted as bridges between Vietnamese 
family life and the surrounding American society. 

Zhou & Bankston also analyzed in depth the pattern of gender role change to 
show how the immigrant orientation toward upward mobility could perpetuate 
and modify traditional cultural patterns. They found that in the case of the 
Vietnamese the older generation was willing to modify the original culture to 
adjust to life in the host society. Vietnamese Americans still believe that women 
should be subject to higher levels of social control and parental authority than 
men are. But the mobility orientation of Vietnamese Americans means that so- 
cial control is no longer a matter of preventing women from acquiring education 
but rather it acts to push them toward educational excellence. Paradoxically, 
the young women who are more controlled by the family and the ethnic com- 
munity show higher levels of adaptatión to the American school environment 
(Forthcoming, Chapter 8). 

These case studies of the Chinese in New York and the Vietnamese in New 
Orleans highlight the important role the ethnic community can play in channel- 
ing frustrations with cultural conflicts away from rebellion and instead turning 
cultural tensions into pressures for achievement. Whether results from these 
cases are generalizable across immigrant groups is inconclusive, however. 


Patterns of Language Acquisition and Language Use 

Linguistic adaptation is another challenge for the new second generation. Pro- 
ficiency in English has been regarded as the single most important prerequisite 
for assimilation into American society and as a strong social force binding the 
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American people together. Lack of English proficiency, aggravated by the prob- 
lem of linguistic isolation and disadvantages associated with minority status, 
has been a severe handicap for new immigrants and their children. Geograph- 
ically, some immigrant children are concentrated in linguistically distinctive 
neighborhoods where their native tongue is used more commonly than English 
by the people around them. This leaves immigrants in direct daily contact 
only with other immigrants or with members of native minorities rather than 
with members of the dominant majonty, who have moved away from areas of 
immigrant concentration. At the school level, immigrant children often find 
themselves in classrooms with other immigrant children speaking a language 
other than English or with other native minority children with distinct inner- 
city accents, The Los Angeles Unified School District has recently identified 
87% of the district’s students as “minority” and 40% as having limited English 
proficiency (LEP) (Lopez 1996). Meanwhile, immigrant children constantly 
face the pressure to become proficient in English, from the pervasive pull of 
American popular culture and the media, from their schoo] teachers, and from 
their often non-English speaking parents, who at the same time pressure them 
to preserve their ancestral tongues. 

The issue of language, then, is one that must be addressed in any examina- 
tion of the adaptation of immigrant children. Does the language problem con- 
fronting immigrant children hinder their assimilation into American society? 
Does the maintenance of parents’ native tongues necessarily lead to unfavor- 
able outcomes as these children adapt to school and grow into adulthood? The 
doctrine of “forcible assimilation” insists that English language skills compete 
with non-English language skills; that bilingualism causes academic failure, 
anxiety, and mental confusion of immigrant exposed to two languages; and 
that non-English skills should be wholeheartedly abandoned. Associated with 
this ideology has been a “sink or swim” linguistic policy in American public 
schools. Such a policy holds that children from non-English speaking homes 
should be placed in school environments where only English is tolerated. 

In recent years, adherents of “forcible assimilation” have tended to advance 
a sociological, rather than a psychological, argument for monolingualism in 
schools. Advocates of the “English Only” movement maintain that bilingual- 
ism can inhibit social adaptation in a predominantly English-speaking society, 
creating a "new apartheid.” This position, however, has been supported more 
often by rhetorical skills than by empirical evidence. One exception was a 
literature review of research on bilingualism from the US Department of Edu- 
cation (Baker & de Kanter 1981). This report reviewed past studies on bilingual 
education and evaluated the evidence on the advantages of bilingualism. It con- 
cluded that bilingually educated students score below average in both English 
skills and general academic achievement. 
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Unlike forcible assimilation, “reluctant bilingualism” is tolerant of bilingual- 
ism and supports its use in schools. However, the reluctant bilingualists endorse 
the use of non-English language only as a strategy for achieving the ultimate 
goal of linguistic assimilation. Reluctant bilingualists advocate programs of 
“transitional bilingual education.” Transitional bilingualism is established as a 
stated policy of the US Government by the Bilingual Education Act of 1968. 
The goal is to help students keep up with reading, math, and other subjects in 
their native tongues while they are taught enough English to transfer to regular 
classrooms. The native language, from this view, is also treated as the source 
of a hindrance rather than as an asset. 

The debate over the language issue has largely ignored the fact that language 
acquisition and language use are constrained by contextual factors as well as 
individual preferences. Lopez (1996) found that English monolingualism at 
home increased from one generation to the next, and that the shift was more 
rapid in the third generation than in the second generation. However, such a 
shift was more rapid among Asian-Americans than among Latino Americans 
across generations. Likewise, the shift from bilingualism (proficiency in both 
English and native language) to English monolingualism in the third generation 
was more substantial among Asian-Americans than among Latino Americans. 
He pointed out that the better maintenance of Spanish among Latino Americans, 
especially in the Southwest, is attributed to the proximity to Mexico, residential 
isolation from the dominant group, and ethnic concentration in the sense of sheer 
numbers. 

How do language patterns shift and how does language use affect adapta- 
tional outcomes of immigrant children? The negative view of parental native 
languages has been challenged by studies on immigrant children since the 1960s 
(Portes & Rumbaut 1996, Chapter 6). A growing body of empirical evidence 
indicates that both cognitive abilities and scholastic achievement are actually 
positively associated with bilingualism. Tienda (1984) showed that the reten- 
tion of Spanish proficiency did not hinder the socioeconomic achievement of 
Hispanic-American men, and that, at the minimum, bilingual education would 
notretard lifelong achievement. Fernández & Nielsen (1986) provided evidence 
from national logitudinal data that, among Hispanic- and European-American 
high school students, proficiency in both English and parental native languages 
was positively related to academic achievement. Matute-Bianchi (1986) found 
in an ethnographic study of Mexican-American children that advanced bilin- 
gual skills were related to a strong Mexican identity and that fully bilingual 
young Mexican-Americans tended to perform better in school than those who 
lacked proficient bilingual skills. She concluded that proficiency in the native 
language allows young people to gain greater access to the emotional and nor- 
mative supports of the ethnic group. In a study of Chinese immigrant children 
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in New York City’s Chinatown, Sung (1987) found that bilingual students had 
higher student retention rates, more graduates, and higher self-esteem. She 
suggested that these positive outcomes were associated with the acceptance of 
distinctive ethnicity. Other researchers have found that language maintenance 
bilingual programs, as opposed to transitional bilingual programs, helped stu- 
dents learn the language of the dominant society effectively (Cazden & Snow 
1990, Cummins 1980, 1981, Bhatnager 1980). 

A recent two-period (1986-1987 and 1989-1990) study of the entire high 
school student cohorts in San Diego Unified School District (the nation’s eighth 
largest) reported that, with the main exception of some Hispanic students who 
were generally of much lower socioeconomic status, all of the non-English 
immigrant minorities were outperforming their English-only co-ethnics as well 
as majority European-American students. This finding applied to both bilin- 
guals (designated by the District as having fluent English proficiency—FEP) 
and semi-bilinguals (designated as having limited English proficiency—LEP), 
though FEP students did better than their LEP coethnics. In standardized tests, 
while English monolinguals tended to have the highest scores in reading com- 
prehension, FEP students showed significantly higher scores in math than 
English monolinguals, for every nationality. The same study also reported that 
Vietnamese students, who are mostly members of the one-and-a-half genera- 
tion, were among the ethnic groups with the highest GPAs, trailing only after 
Chinese, Koreans, and Japanese. In terms of dropout rates, however, the re- 
sults were mixed, While FEP, or truly bilingual, students consistently showed 
much lower dropout rates than English monolingual students across all ethnic 
groups, LEP students were observed to be at the greatest risk of dropping out 
(Rumbaut 1995, Rumbaut & Ima 1988). Bankston & Zhou (1995) reported 
similar findings concerning the effect of literacy 1n the parental native language 
from their study of Vietnamese high school students in New Orleans. 

Moreover, the ethnic language is intrinsic to ethnicity. Under certain condi- 
tions, it allows immigrant children to gain access to some kind of social capital 
generated from a distinctive ethnic identity, such as support and control from 
bilingual or non-English-speaking parents and ethnic communities. Bankston 
& Zhou (1995) found that literacy in the ethnic language was strongly associated 
with ethnic self-identification, which in turn, contributed to academic excel- 
lence. They concluded that advanced ethnic language abilities, such as literacy, 
were related to achievement because ethnic language skills tied immigrant chil- 
dren more closely to their traditions, their families, and their communities that 
enforce the values of academic achievement. 

However, not all immigrant children can benefit from bilingualism because 
patterns of language use are affected by class, race/ethnicity, length of US resi- 
dence, and other contextual factors. In an earlier study, Lopez (1976) found that 
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the use of Spanish depressed occupational attainment indirectly by lowering ed- 
ucational achievement among Chicanos because Chicanos concentrated in areas 
and occupations that allow few returns to the knowledge of Spanish. Fernandez 
and Nielsen (1986) also reported that the positive effect of bilingualism tended 
to diminish with longer duration of US residence, and that the frequent use of 
Spanish was negatively related to academic achievement after controlling for 
English and Spanish proficiency. They attributed the contradictory outcomes 
to a specific handicap associated with Hispanic membership. 

The language issue, therefore, is not just an linguistic issue but has deep- 
rooted sociological implications. Of course, we cannot entirely discount the 
possibility that the effect of ethnic language on overall academic achievement 
may be due, in part, to a transference of cognitive development: Skills developed 
in learning to read the parental language may be transferred to other areas of 
intellectual endeavor, such as history, geography, or mathematics. Nevertheless, 
the acquisition of English and the maintenance of parental native languages do 
not function in isolation from social contexts (also see Bialystok & Hakuta 
1994). 


CONCLUSION 


For immigrant children and children of immigrants, growing up American can 
be a matter of smooth acceptance or of traumatic confrontation. Immigrant 
children are generally eager to embrace American culture and to acquire an 
American identity by becoming indistinguishable from their American peers. 
In some cases, however, they may be perceived as “unassimilated” even when 
they try hard to abandon their own ethnic identities. In other cases, they may be 
accepted as well-adjusted precisely because they retain strong ethnic identities. 
In the long journey to becoming American, the progress of today's one-and- 
a-half and second generation is largely contingent upon human and financial 
capital that their immigrant parents bring along, the social conditions from 
which their families exit as well as the context that receives them, and their cul- 
tural patterns, including values, family relations, and social ties, reconstructed 
in the process of adaptation. The host society offers uneven possibilities to 
different immigrant groups. These unequal possibilities may limit the oppor- 
tunities of immigrant groups, but they do not necessarily constitute a complete 
denial of opportunity. 

Immigrants are today being absorbed by different segments of American so- 
ciety, but becoming American may not always be an advantage for immigrant 
children and children of immigrants. When immigrants enter middle-class com- 
munities directly, or after a short transition, it may be advantageous for them to 
acculturate and assimilate. When they enter the bottom of the ethnic hierarchy 
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of drastic social inequality, the forces of assimilation come mainly from the 
underprivileged segments of this structure, and this is likely to result in distinct 
disadvantages, viewed as maladjustment by both mainstream society and the 
ethnic community. In this case, young immigrants or children of immigrants 
may benefit by cultivating their ethnic ties in their ethnic communities to de- 
velop forms of behavior likely to break the cycle of disadvantage and to lead 
to upward mobility. 

The interest in immigrant children and children of immigrants has recently 
been growing. However, there is still a big gap between the strategic impor- 
tance of the new second generation and current knowledge about its conditions 
(Portes 1996). Data on which the existing body of research is based come 
mostly from regional survey research and ethnographic studies on selected 
immigrant groups. Census data sources have been, or are being, scrutinized 
by some researchers to describe the current state of immigrant children, their 
geographic distribution and demographic and socioeconomic characteristics, 
school attendance, fertility patterns, labor market opportunities facing entrants 
to the labor force, and the establishment of independent households (Hirschman 
1994, Jensen & Chitose 1994, Mcllenkopf et al 1995, Landale & Oropesa 
1995, Zhou & Bankston, forthcoming). À major drawback of the census data 
(1980 and 1990) is that a critical variable—the birthplace of parents—has been 
dropped from the decennial census since 1980, making it impossible to identify 
directly the children of immigrants (Hirschman 1994). Researchers have to use 
the ancestry question as a proxy. This treatment of ethnic origin variable is 
problematic. Perlmann & Waldinger (1996) note that, because of high rates of 
intermarriages in the third generation, the respondent's choice of ethnic identity 
is selective, making it difficult to accurately predict the independent effect of 
ethnic origin on intergenerational mobility. 

Moreover, the census data do nct have any direct measures for contextual 
effects of the family, the school, the neighborhood, and the ethnic community, 
nor do they have detailed information on school performance. There are a 
few other national surveys that offer important data that the census lack, such 
as NELS and the National Longitudinal Study of Adolescent Health (known 
as "Add HEALTH"). These data sets have over-sampled some minority and 
immigrant groups and have detailed information about contextual influences 
of the family, the school, and the community on adolescent health, behavior, 
family life, peer relationships, goals, aspirations, academic performance, and 
related variables. However, they do not contain viable subsamples of the most 
recently arrived national-origin groups within broader regional categories to 
conduct comparative analyses. 

For further theoretical inquiry, the following questions may offer some stim- 
ulus: Will members of a generation born or reared in the United States gradually 
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be pulled away from a heritage vastly different from those of the Europeans 
who arrived over the course of this century? Will those who rebel against this 
heritage be the best-adjusted, socially and economically? Will racial barriers 
limit the participation of immigrant children in American life? How would 
being hyphenated Americans influence the ways in which immigrant children 
become assimilated, and why may some of these ways be more advantageous 
than others? Will immigrant families and ethnic communities persist in affect- 
ing the lives of children of the second generation? Will cultural distinctiveness 
of hyphenated Americans eventually melt down into a pot of Anglo-American 
homogeneity? If not, what will ethnic diversity mean for the offspring of today’s 
new second generation? Each of these questions has theoretical as well as prac- 
tical implications. Given the unique characteristics of and the scanty knowledge 
about the complex ways in which the second generation of new immigrants are 
“becoming American,” future studies are both urgent and necessary. 
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ABSTRACT 

Feminist theories in sociology reflect the rich diversity of general theoretical 
orientations in our discipline; there 1s no one form of feminist theory. The de- 
velopment of these theories over the last 25 years has only recently begun to 
influence the mainstream theory canon, which has much to learn from their in- 
sights. This chapter demonstrates why feminist versions of the following the- 
ory types should be more fully integrated into mainstream sociological theory: 
neo-Marxist, macro-structural, exchange, rational choice, network, status expec- 
tations, symbolic interactionist, ethnomethodological, neo-Freudian, and social 
role. Feminist standpoint theory, an epistemological critique of mainstream so- 
ciology, is discussed at the beginning, and the chapter concludes with a brief 
account of the newly developing effort to theorize the intersection of race, class, 
and gender. 


INTRODUCTION 


The term “feminist theory” is used to refer to a myriad of kinds of works, pro- 
duced by movement activists and scholars in a variety of disciplines; these are 
not mutually exclusive and include: (a) normative discussions of how societies 
and relationships ought to be structured, their current inequities, and strategies to 
achieve equity; (b) critiques of androcentric classical theories, concepts, episte- 
mologies, and assumptions; (c) epistemological discussions of what constitute 
appropriate forms, subject matters, and techniques of theorizing from a feminist 
perspective; and (d ) explanatory theories of the relationship between gender and 
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various social, cultural, economic, psychological, and political structures and 
processes. Much of this work is explicitly interdisciplinary in inspiration and 
intended audience. To complicate matters further, there is no consensus on the 
exact meaning of the word "feminist," which makes it difficult to distinguish 
with precision between theoretical material that pertains to gender (e.g. Parsons 
1949, 1955, which no one would label feminist) and gender-related theory that 
is specifically "feminist" Finally, there is little consensus among feminist so- 
ciologists about the basic theoretical questions that require an answer, resulting 
in the proliferation of theories at a low level of abstraction that explain specific 
phenomena (e.g. pay inequity), in addition to more abstract, general works. 

To remain within the limits of one chapter, I confine this review in several 
ways, beginning by excluding feminist theory that has not been produced or used 
extensively by sociologists. While feminist theory is often defined as ^women- 
centered" (e.g. Lengermann & Niebrugge 1996:436; Smith 1979, 1987; Alway 
1995), I use a definition that focuses more broadly on gender, yet maintains the 
normative emphasis implied by all definitions of the term feminist, which thus 
enables one to distinguish feminist from other gender-relevant theory. Earlier 
(Chafetz 1988:5), I defined it in terms of four criteria, which guide my selection 
of theories to be reviewed in this chapter: (a) "gender comprises a central 
focus or subject matter of the theory"; (b) "gender relations are viewed as a 
problem .... [F]eminist theory seeks to understand how gender is related 
to social inequities, strains, and contradictions"; (c) "gender relations are not 
viewed as. . . immutable”; and (d) feminist theory “can be used . . . to challenge, 
counteract, or change a status quo that disadvantages or devalues women.” I 
focus most of my attention on explanatory theories but eschew discussion of 
the numerous, more substantively narrow ones. Finally, I limit this review to 
writings produced since the broad-scale reemergence of feminist consciousness 
and activism in the late 1960s, which began to affect sociological discourse after 
about 1970. This limitation does not mean, however, that no works that could 
be considered feminist theory were produced by sociologists before this time 
(see Fitzpatrick 1990, Deegan 1988, Donovan 1985, Rosenberg 1982). 

A decade ago, Stacey & Thorne (1985; also Laslett et al 1992, Alway 1995) 
bemoaned the failure of sociology in general, and sociological theory in partic- 
ular, to incorporate feminist insights as central components of its work. During 
the last two decades, a significant amount of feminist sociological theory and 
epistemology has been produced, as reflected in two books that provide broad 
overviews (my undergraduate text, Chafetz 1988; a more sophisticated book 
edited by England 19932), and one less comprehensive collection of theoretical 
papers (Wallace 1989). Several interdisciplinary feminist theory books, which 
incorporate some sociological theory, have also appeared (e.g. Tong 1989, 
Gergen 1988, Rhode 1990). Recent theory textbooks in sociology (e.g. Ritzer 
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1996, Etzkowitz & Glassman 1991, Waters 1994, Wallace & Wolf 1995) have 
begun to include some discussion of feminist theory, but they vary widely in the 
extent and nature of their coverage of the topic. These texts typically confine 
discussion of feminist theory to its own chapter or chapter section. This prac- 
tice is problematic both because it allows scholars and students to easily skip 
the topic and because it makes the contributions of feminist theorists appear 
more narrow and homogeneous than they are. In addition, many texts still omit 
feminist theory altogether, and a number of important contemporary theorists 
(e.g., James Coleman, Jeffrey Alexander, Peter Berger, Anthony Giddens) have 
ignored both feminist theoretical insights and the very topic of gender in their 
“general” theories (Seidman 1994:304). 

Some feminists focus on those contemporary theories and texts that ignore 
the contributions of feminist theories and the topic of gender and conclude that 
feminist contributions remain largely ghettoized within our discipline (e.g. Ward 
& Grant 1991, Alway 1995). My view is that, while progress has been made 
in integrating feminist concerns and insights into the discipline’s theoretical 
discourse, much work remains to be done. This chapter demonstrates the 
abundance and variety of feminist theoretical insights that can and already have 
to some extent contributed to a more robust theoretical understanding of social 
life, one which reflects the centrality of gender in virtually all sociocultural 
contexts. It also demonstrates that feminist theories emanate from, critique, and 
revise the rich array of theoretical traditions that define our discipline. Space 
limitations preclude much discussion of precisely how feminist theories can be 
better integrated with mainstream ones. Rather, I focus attention primarily on 
reviewing the central insights of feminist theories in order to better inform those 
sociologists who may be unfamiliar with much of this body of work about the 
rich array of theoretical ideas that are at their disposal. 


EPISTEMOLOGICAL ISSUES 


Much of the literature that is labeled "feminist theory” consists of epistemology 
and epistemological critiques of “malestream” sociology. Its foundations reflect 
several nonfeminist traditions, especially Marx’s and Mannheim’s discussions 
of ideology, Foucault’s work on knowledge and power, and phenomenological 
and ethnomethodological approaches, the exact mix of influences varying by 
author. While this work makes important contributions to these traditions, for 
two reasons I believe that it is a misnomer to call this work feminist epistemology 
(or theory). First, the issues raised are not in any fundamental way different 
from those raised by many scholars who have worked in these traditions but have 
not been interested specifically in women or committed to feminism. Feminists 
extend their insights ın important ways, but this does not constitute a uniquely 
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feminist approach to sociology. Second, many women in sociology, whose 
scholarship they and others consider as well within the feminist tradition, do 
not agree with this perspective. 

Feminist scholars in a number of disciplines critique what they define as 
mainstream, "masculinist," “objectivist,” and “positivist” social science, and 
develop a “feminist” alternative called standpoint theory. In sociology, the 
two most widely cited are Dorothy Smith (especially 1987, 1990, also 1979, 
1989) and Patricia Hill Collins (especially 1990, also 1986, 1989), whose basic 
ideas constitute the focus of this section (see also Harding 1986, 1991). Where 
Smith focuses on developing a “woman’s standpoint,’ Collins’ work is directed 
at an Afrocentric feminist standpoint epistemology. Smith explicitly locates 
the origins of her ideas in Marx, Foucault, and ethnomethodology, Collins 
primarily in Mannheim, several well-known contemporary feminist theorists in 
diverse disciplines (Chodorow 1974, 1978, Gilligan 1982, Harding 1986, Jaggar 
1983, hooks 1981, Smith 1987, Harstock 1983, 1985), and a myriad of mostly 
female, African-American thinkers. The ideas expressed by these two scholars 
incorporate those of a large number of others (see Sprague & Zimmerman 1993 
for a sympathetic yet critical discussion of standpoint theory, and Laslett et al 
1992 for a review of Smith’s work). 


The Critique of Mainstream Sociology 


Smith and Collins (also Cook & Fonow 1986, Farganis 1986, Haraway 1988, 
in addition to those cited above) begin with the idea that all knowledge about 
the social world reflects the social position(s) of the knower and, therefore, at 
best can result in no more than a partial understanding of that world; there is no 
Archimedian perspective outside of one’s socially constituted standpoint. Un- 
like some feminists who argue for the superiority of a woman’s standpoint as dis- 
empowered “outsider,” Smith and Collins explicitly recognize that a woman’s 
or a feminist standpoint is no less situated and partial than those they critique. 
For Marx and Mannheim, the “standpoint” of a knower is defined in terms 
of social class. Smith adds gender and Collins adds race and gender to class 
in defining the chief dimensions of those standpoints. They begin their cri- 
tique of accepted sociological knowledge on the basis that, until recently, the 
knowers had one common standpoint—that of white, middle class male; other 
standpoints have been effectively silenced as contributors of "credible" social 
scientific knowledge. Virtually all feminist scholars (and many others) agree 
that by diversifying the kinds of knowers in sociology, new questions are raised 
about social life, new data sought to answer them, and new interpretations of re- 
ceived wisdom are proffered. In short, different standpoints lead to differences 
in what scholars think about. However, for Collins and Smith this is merely the 
starting point in their critique of mainstream sociology. 
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Collins and Smith recognize the diversity of experiences, hence standpoints, 
among women. Nonetheless, their logics assume that each gender has a stand- 
point and that it results in profound differences between the ways women and 
men think, in addition to what they think about. Smith (1990) refers to “male- 
created discourse" and talks about male thinking as “objectifying” experience, 
thereby creating "lines of fault" between women's subjective experiences and 
the way women sociclogists write about them (if they conform to the stan- 
dards of the discipline), resulting in a “bifurcated consciousness" Women's 
everyday world is alienated and objectified by the very categories of analysis 
that, as sociologists, they are taught and expected to bring to their work, cate- 
gories that reflect and support the “sccial relations of ruling" that oppress them. 
Collins (see also Dill 1979, 1983, King 1988) speaks of a “distinctive Afrocen- 
tric women’s culture” of resistance (p. 11) which represents “the simultaneity 
of race, class and gender oppression” in a “matrix of domination.” This leads 
them to reject white male “either/or” thinking, rather opting for a "both/and" 
orientation to intellectual (as well as practical) life. Collins describes “posi- 
tivism" as “a Euroceniric masculinist epistemology” that is highly problematic 
for understanding African-American women’s lives. She defines positivism as 
attempts to produce “objective generalizations” (p. 205): "[Scientists] aim to 
distance themselves from the values, vested interests, and emotions ... [and 
thus] decontextualize themselves [in order to become] detached observers and 
manipulators of nature." She criticizes this epistemology not only because it 
treats the subjects of research as objects (as does Smith), but also because of 
the absence of emotion, ethics, and values, and because of the preferred style 
of "adversarial debate" in establishing knowledge claims. Like Smith, Collins 
concludes that this mode of thinking by sociologists fosters a system that op- 
presses Black women. 

To Smith and Collins, the taken-for-granted concepts, language and style 
of writing and of making truth claims in sociology are male-created, alien to 
women, and function to support patriarchy, specifically, “relations of ruling" 
(Smith) or "the matrix of domination" (Collins) more generally. It is here that 
other feminist sociologists (including this author) part company with standpoint 
theory, unconvinced that men's and women's ways of thinking categorically 
differ or that the dominant ways of doing sociology are inherently masculine 
and necessarily antithetical to feminists' concerns (see especially Coser 1989). 


The Proposed Alternative 

Smith and Collins share a radical empiricism; feminist sociologists should es- 
chew the standard, "masculinist" conceptual tools of the field and begin with 
immersion in women's experiences of everyday life. For Collins, an Afrocentric 
feminist epistemology uses the scholar’s “own concrete experiences as situated 


102 CHAFETZ 


knowers in order to express a Black women's standpoint" (p. 16). Smith recom- 
mends that we explore the world as "insiders" by making "the everyday world 
problematic" (1990, p. 26). In a tour-de-force examination of textual material, 
smith (1990) demonstrates how the data used by sociologists are prepackaged 
by agencies and other professionals (e.g. physicians, police and courts, social 
workers) in ways that express and reinforce the relations of ruling and there- 
fore must be eschewed in favor of examination of the direct experiences of the 
people whose lives we seek to understand. 

Smith appears to assume that the process of thoroughly critiquing and decon- 
structing textual material created by sociologists and other agents of the ruling 
system, based on the female sociologist’s own lived experience, will suffice 
to lead to new concepts and general (theoretical) understandings that reflect 
woman's standpoint. While she explicitly rejects a completely “subjectivist” 
sociology that avoids all abstraction, she does not explicate any method for 
moving from the realm of personal experience to a more abstract, systemic 
level of understanding, a level she presupposes by talking about patriarchy and 
capitalism. 

Collins gives somewhat more concrete advice by outlining three specific com- 
ponents of an Afrocentric feminist epistemology, in addition to the emphasis on 
beginning with experience. First, knowledge claims should arise from dialogue 
and stress connectedness between and active participation of researchers and 
their subjects, not from adversarial relations between knowers and the objec- 
tification of research subjects. Second, “personal experience, emotions, and 
empathy are central to the knowledge validation process" (p. 215); emotion 
is not separate from intellect (a point seconded by Smith). Third, Collins calls 
for an ethic of personal accountability among sociologists; knowledge claims 
should be evaluated in terms of what one knows about the “character” (values 
and ethics) of the knower (p. 218). Nonetheless, it remains unclear in Collins' 
work exactly how the Afrocentric feminist sociologist should move from the de- 
scriptive level of women's everyday experience to the more abstract, theoretical 
level she presupposes by talking about African-American women's oppression 
within a system that is patriarchal, racist, and classist. 

Ultimately, their rejection of abstraction as a masculine activity prevents 
Collins, and especially Smith, from proposing an epistemology that can in- 
form feminist conceptual and theoretical development, beyond the prescription 
that it must be thoroughly inductive. This shortcoming they share with non- 
feminist sociologists who share their critique of “positivism” in sociology and 
opt for thorough immersion in the world of everyday experience (e.g. most 
ethnomethodologists and some symbolic interactionists). However, wherever 
these “male” approaches are discussed in the sociological canon, Smith's and 
Collins's work deserve serious consideration as well. They add rich discussions 
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of the importance of the previously silenced standpoints of (African-American) 
women for understanding the taken-for-granted aspects of everyday life. In 
addition, theoretical discussions of ideology and the sociology of knowledge 
need to include their works, inasmuch as they clearly demonstrate the neces- 
sity of broadening concern from focusing on class to including gender and 
race/ethnicity in understanding the social-rootedness of thought systems and 
the intellectual roots of social power. 


The Issue of Essentialism 


Feminist standpoint theory, which is highly attuned to reification committed by 
mainstream sociologists, cannot avoid reifying the genders. While Smith and 
Collins explicitly recognize considerable variation among women (and pre- 
sumably men) in their experiences and consciousness, their own logics, and 
many times wording, make it clear that they assume that there are overarching, 
gender-specific standpoints; they could not otherwise talk about a “masculine” 
form of discourse. In addition, Collins explicitly cites such feminist theo- 
rists as Gilligan (1982) and Chodorow (1978, also 1974), who argue that the 
genders are fundamentally different in their moral reasoning and capacities 
for/commitments to interpersonal relationships. 

Positing dichotomous gender differences that are treated as transcultural and 
transhistorical is termed "essentialism," a view that has substantial currency 
among feminists in a variety of disciplines but is hotly contested in our own 
(e.g. Lorber et al 1981, Coser 1989, Epstein 1988). The empirical evidence for 
itis flawed, often based on small, nonrandom, American samples, and typically 
finds only modest differences, along with extensive overlap, between the sexes. 
Essentialist thinking converts differences of degree into differences of kind. The 
presumed but often unstated origin of essential differences includes psychody- 
namics rooted in the parental division of labor (Chodorow 1978) and biological 
sex (Rossi 1977, 1984). It has become common for feminist scholars to recog- 
nize within-gender categorical differences (e.g. race, class), but this awareness 
of difference has often failed to preclude essentialist thinking about basic per- 
sonality and value orientations (e.g. the assumption that, regardless of other 
differences, women are nurturant and oriented toward personal relationships, 
while men are individuating and oriented toward abstract moral principles). 
Given that the evidence suggests modest between-sex and considerable within- 
sex differences on virtually all individual-level traits, a dichotomous gender 
variable is theoretically useless when speaking of individual-level phenomena. 
Explanations that begin by categorically attributing different characteristics 
to women and men-—cognitive, emotional, relational, and/or behavioral—not 
only exaggerate differences in the distribution of such traits by gender, they also 
implicitly treat these variables as dichotomous rather than continuous. They 
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reflect the “either/or” thinking explicitly rejected by Collins, yet implicit in her 
logic. One can legitimately talk about average differences between females and 
males on individual-level variables, but care must be taken to avoid reification 
by delineating the average differences in experiences/opportunities/constraints 
that account for them and by explicitly recognizing the range of within-gender 
variation and between-gender overlap. Moreover, a dichotomous conceptualiza- 
tion of gender can be a theoretically meaningful aspect of social structure, such 
as when one talks about the degree of male-female occupational segregation or 
the extent to which an ideology devalues females relative to males. 


Conceptual Problems 


Treating differences of degree as differences of kind is also manifest in macro- 
level concepts employed by many feminist theorists, especially “patriarchy,” 
“exploitation,” and “oppression.” One rarely reads statements that contain vary- 
ing levels of these phenomena (e.g. society A is less patriarchal/oppressive/ 
exploitative than B). They are usually treated as constants, and the emphasis is 
placed on understanding the particular form of patriarchy/oppression/exploita- 
tion in a given time, place, and/or within a specific socioeconomic structure 
(usually capitalism). Yet the empirical literature clearly demonstrates consid- 
erable cross-societal variation on those dimensions that can be taken as indica- 
tors of the level (not simply form) of gender inequality (e.g. Martin & Voorhies 
1975, Sanday 1974, 1981, Blumberg 1978, 1984, Chafetz 1984). 

These terms are infrequently defined, and when they are, their definitions 
are often too broad, thereby obscuring the dynamics of gender stratification 
systems. Patriarchy, for example, has been used to refer to some combina- 
tion of the following: a type of family structure, an ideology (religious and/or 
secular), and one or more properties of the economy and/or polity. This kind 
of truth-asserting definition obscures questions of the extent to which and how 
these various phenomena are related to one another by assuming their empirical 
isomorphism. “Patriarchy” is often reified by the use of an active verb, as in 
“patriarchy causes/creates/requires ... .” When this happens, the explanatory 
content evaporates completely. Regardless of conceptual problems, in the re- 
mainder of this chapter I use the vocabularies employed by the theorists whose 
works are being reviewed. 


NEO-MARXIST THEORIES 


Marxist-inspired feminist theory, most of which today is called socialist-femin- 
ist, differs from orthodox Marxism (and orthodox Marxist feminism) by insist- 
ing that the nonwaged labor that maintains and reproduces workers, and is done 
overwhelmingly by women, is equally as important as waged labor, and that 


FEMINIST THEORY AND SOCIOLOGY 105 


oppression for women results equally from patriarchy and from class structure, 
not simply as a by-product of class relationships. It differs from other feminist 
theories by insisting that, while not sufficient to bring about the demise of patri- 
archy, the abolition of capitalism is a necessary condition, inasmuch as capital- 
ism derives numerous advantages, hence support from patriarchal institutions, 
ideology, and practices. While recognizing that patriarchy predated capital- 
ism, and scarcely disappeared in twentieth century socialist nations, Marxist- 
inspired feminists argue that within capitalist systems, patriarchy assumes 
unique forms that are interwoven with capitalism in mutually supportive ways. 

Within capitalist systems, the division of labor by gender makes women re- 
sponsible for the unwaged maintenance and reproduction of the current and 
future labor force, variously termed domestic work, production of use value, or 
necessary labor. Women may also be involved in what is variously termed pro- 
duction of exchange value, social labor, or surplus value through waged work, 
as are men. The nonwaged work done by women is crucial and profitable for 
capitalists, who get its benefits for free, and, therefore, such labor is exploitative 
and oppressive for women. Although earlier in the history of capitalism most 
women were denied the opportunity to become “social adults” through waged 
labor (Sacks 1974), more recently they have been sought by capitalists as a 
source of cheap labor in a highly gender-segregated labor market (Eisenstein 
1979). The gender inequities women experience in the labor market are linked 
both practically and ideologically to their responsibility for nonwaged domestic 
work (Hisenstein 1979, Vogel 1983, Shelton & Agger 1993). The dual exploita- 
tion of women within the household and in the labor market means that women 
produce far greater surplus value for capitalists than do men (Shelton & Agger 
1993). 

An ideology of patriarchy, or male supremacy, fostered by capitalists, un- 
dergirds and sustains both forms of female oppression. This ideology justifies 
women’s nonwaged domestic responsibilities with reference to biologically 
rooted reproductive differences between men and women and justifies gender- 
based labor market inequities with reference to women’s domestic obligations 
(Eisenstein 1979). In turn, working class men derive advantages both within the 
household (free domestic services and subservience from their wives, resulting 
from their economic dependence) and in the labor force (better paying jobs are 
reserved for men). No or low wages tie women to their better paid husbands in a 
subordinate position, and therefore to domestic labor, which in turn suppresses 
their wages (Hartmann 1984, Sacks 1974). Sacks (1974) argues that in this 
manner capitalists “compensate” men for their subordination to capitalist dom- 
ination, which impedes the development of class consciousness among workers, 
reinforcing capitalist domination (also Shelton & Agger 1993, Sokoloff 1980). 
Wives' economic dependence also ties men more securely to wage-earning jobs, 
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further serving the interests of capitalists by undermining potential rebellion 
against the system (Eisenstein 1979, Vogel 1983, Hartmann 1984). 

Feminist scholars have extended Marxist-based world systems theory (and its 
cousin, dependency theory) by demonstrating how capitalist penetration by core 
nations of peripheral ones usually reduces the status of women, thereby exacer- 
bating many problems (e.g. high fertility rates, poverty, and income inequality) 
in the peripheral nations (Ward 1984, 1990, 1993, Blumberg 1989). Ward 
(1993:48) criticizes world systems theory for assuming that women participate 
in the modern world economy only as members of households in which the 
male “head” is incorporated, thereby ignoring women’s direct role in the global 
economy and their economic contributions in the informal labor market as well 
as in the household. A massive research literature on women and development 
demonstrates the usually widely disparate effects of socioeconomic develop- 
ment for men and women, to the detriment of women. These findings are largely 
ignored by world systems theorists, who assume that household members have 
unitary interests (see also Blumberg 1988, 1989). As a corrective, Ward (1993) 
emphasizes the need to fully incorporate the crucial contributions women make 
to the food supplies and general economies of poor nations (see also Mies 1986). 
Earlier, Ward (1984) proposed that the specific, local effects on women’s work 
and status of Western capital penetration must be understood in terms of pre- 
existing patterns of “patriarchal relations,” including ideology and institutional- 
ized patterns of male dominance. In addition, the gender-based presuppositions 
of Western (male) capitalists affect the distribution of new resources and oppor- 
tunities between men and women in peripheral nations. Ward concluded that 
the level of foreign investment in peripheral nations and their trade dependency 
on core nations are positively related to the level of gender inequality. 

By extending Marxist analysis to include nonwaged maintenance and repro- 
ductive labor, and broadening the Marxian concept of ideology to encompass 
patriarchal thought, Marxist-inspired feminists demonstrate that gender is as 
central a component as class in understanding exploitation/oppression within 
capitalist systems and in understanding how capitalist systems are maintained 
and strengthened. These contributions merit serious attention in any scholarly 
discussion of neo-Marxist thought. 


CULTURAL AND SOCIAL MACROSTRUCTURAL 
THEORIES 


The macrostructural feminist theories that are not explicitly neo-Marxist divide 
mostly into two categories: those that emphasize the causal primacy of culture 
and ideology, and those that emphasize the centrality of socioeconomic fac- 
tors (for a review of both see Dunn et al 1993). Recently, Collins et al (1993) 
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attempted a grand synthesis of extant gender theories that emphasizes social 
structural constructs but includes elements from most other types of gender the- 
ory (see also Chafetz 1990, for a slightly less ambitious effort to do likewise). 
The social structural theories are sufficiently complex as to defy brief explica- 
tion, beyond a listing of central constructs. What macrostructural theories have 
in common is their goal: to explain variation in the level of gender stratification 
across time/space, and/or to explain how a given level is maintained and changed 
(see Chafetz 1984:4—7 for a conceptual definition of "gender stratification"). 

Virtually all feminist scholars agree that ideologies and related symbols and 
rituals that devalue wcmen and explain and justify different and unequal treat- 
ment by gender constitute an important component of gender stratification sys- 
tems. A few anthropologists make a cultural construct central in their explana- 
tions (e.g. Rosaldo 1974, Ortner 1974, and especially Sanday 1981, also 1974). 
Ortner's (1974) and Rosaldo's (1974) arguments are relatively simple: Because 
of women’s reproductive functions in birth and lactation, and the gendered di- 
vision of labor within the household and broader society that are typically 
constructed based on them, women become more identified with “nature” and 
domesticity, men with "culture" and the public sphere. In turn, culture and the 
public sphere are more highly valued socially and, therefore, the more strongly 
differentiated and segregated the two spheres, the greater the level of gender in- 
equality. In a more nuanced and fully developed theory, Sanday (1981) argues 
that each society has its own "sex-role plan" that delineates how relationships 
between men and women ought to be structured. These plans arise out of one of 
two overarching cultural orientations: an "inner," in which nature is sacred and 
the "female creative principle" is emphasized, and an “outer,” in which nature 
is seen as dangerous, humans are seen as superior to nature, and men's activities 
(as hunters and warriors) are revered. These orientations are grounded in the 
level of environmental threat and embodied in creation myths, which emphasize 
male, female, or both sources of power in the universe. Enhanced threat leads 
to an outer orientation, the primacy of male deities, and male dominance. The 
degree of gender inequality is thus a direct function of the type of sex-role plan 
as it reflects the general cultural orientation. 

Macrostructural social theories of gender stratification are typically sys- 
temic in nature, often include feedback loops, and emphasize as primary causal 
mechanisms one or more of the following: environmental, demographic, tech- 
nological, economic, and political variables. Intervening constructs include 
the gender division of labor, ideology, and family structure. Lenski's (1966) 
societal typology, based on dominant technology and the resulting level of 
economic surplus, constitutes the starting point for several theories (e.g. Huber 
1988, R Collins 1975, Chafetz 1984, Blumberg 1978). The extent to which 
the environment— physical and social—1s dangerous or threatening constitutes 
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another independent construct in some (e.g. Chafetz 1984, Sanday 1981, as 
discussed above, and especially Harris 1978, who emphasizes the role of war- 
fare). Demographic variables that are considered important include sex ratio 
(especially by Guttentag & Secord 1983, also Chafetz 1984, 1990), population 
density (Harris 1978, Chafetz 1984, 1990), and fertility rates (Huber 1991). 
The size of the economic surplus, contingent upon technological base, is re- 
lated to the level of gender stratification in a curvilinear fashion that peaks in 
agrarian/pastoral societies in the highest levels of inequality. The levels of fer- 
tility, population density, environmental harshness, warfare, and sex ratios are 
generally positively related to the level of gender inequality. One final inde- 
pendent construct concerns political structure, specifically, Collin’s (1975, also 
1972) thesis concerning the extent to which the political organization of the 
society (nation-state), rather than the household, monopolizes the legitimate 
use of force. He argues that (along with women’s level of economic opportu- 
nity) the extent to which the political structure grants individual men the right 
to physically coerce wives constitutes the most important independent variable 
explaining the level of gender stratification. While many feminist scholars have 
explored the role of male violence against women in producing or maintaining 
gender inequality, Collins is alone in making male coercive power central to 
such a theory by linking it to a typology of political structure. 

These independent constructs are typically linked to the level of gender in- 
equality primarily through their impact on three intervening constructs. Like 
the Marxist-inspired feminists, virtually all macrostructural feminist theories 
focus on the key role of the gender division of labor—within the economy and 
between the economic and domestic realms. The more equal the access of 
women to economic roles in the nondomestic sphere (especially where they 
control the products of, and/or income derived from their work), ihe lower the 
level of gender inequality, and the more responsibility women have for the 
domestic sphere, the less equal their opportunities in the economic realm (es- 
pecially Blumberg 1978, 1984, 1988, Collins et al 1993, Chafetz 1984, 1990). 
Besides the domestic division of labor, family structural variables of lineality 
and locality are also important intervening constructs (Martin & Voorhies 1975, 
Blumberg 1979, Chafetz 1984). Women fare worst where these two aspects 
of family structure favor the male side (patrilineage and patrilocality). Finally, 
like the Marxist-inspired feminists, but with less emphasis than that given by 
the cultural theorists, macrostructural social theories recognize the importance 
of religious and secular gender ideologies in buttressing systems of gender 
stratification (Blumberg 1978, 1984, 1988, Chafetz 1984, 1990). 

Macrostructural feminist theories rarely attempt to demonstrate the impact 
of gender stratification on other aspects of social structure (except as feedback 
loops). However, they do demonstrate that virtually all aspects of sociocultural 
structure in all types of societies are implicated in the gender system. Theories 
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concerning technology, work, ideology, family structure, political economy, 
demography, not to mention social inequality, that ignore the ubiquitous phe- 
nomenon of gender are radically incomplete, as are theories that attempt to 
explain social change and/or stability without reference to gender. Macrostruc- 
tural feminist theories provide important insights concerning the linkages be- 
tween gender stratification and other macrolevel structures and processes that 
should be incorporated into general structural theories. 


RATIONAL CHOICE AND EXCHANGE THEORIES 


Rational choice theory has been a target of criticism by several feminist schol- 
ars, most notably England (1989, 1993b, England & Kilbourne 1990, also 
Zelizer 1994). Feminist critiques of it are rooted substantially in an essentialist 
logic, inasmuch as they accuse rational choice theory of assuming a selfish, 
separative, and non-emotional actor who is masculine, thereby ignoring the 
connective, altruistic, and emotional motivations, claimed to be characteristi- 
cally feminine. This and related feminist criticisms of rational choice theory 
are discussed and rebutted by Friedman & Diem (1993). They also demonstrate 
the utility of this perspective for understanding gender inequality by examining 
the implicit rational choice analyses involved in several feminist studies. Their 
general point is that the studies they examine utilize "three mechanisms relied 
upon by rational-choice theorists to explain variation—institutional constraints, 
opportunity costs, and preferences . . ." (Friedman & Diem 1993:101). Rational 
choice thinking is also used in some recent discussions of family change in in- 
dustrial nations, which focus on changes in women's roles and decision-making 
processes concerning number of children, age of first marriage and birth, di- 
vorce, and labor force participation (e.g. Chafetz & Hagan 1996). Not only is 
rational choice theory useful to analyses of gender issues, feminist critiques of 
it focus attention on weaknesses and gaps that require further attention. Specifi- 
cally, the further development of this theory should include selfishness/altruism 
as a variable, consider the role of emotion, and explicitly include interpersonal 
preferences (Friedman & Diem 1993). 

Social exchange theory, which reflects the same utilitarian tradition as ratio- 
nal choice theory, has not been the target of explicit feminist criticism (for an 
exception, see Harstock 1985), although the same criticisms apply to both. It 
has been employed by a few feminist theorists (e.g. Parker & Parker 1979, Bell 
& Newby 1976, Chafetz 1980, Curtis 1986) and is implicit in many feminist 
empirical studies of husband-wife relationships. The general theme of this per- 
spective is that, given the traditionally greater resources available to men from 
sources outside the family, wives balance exchanges with their husbands by 
providing compliance and deference in return for financial support and access 
to other externally generated resources. Husbands also garner a considerable, 
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self-reinforcing power advantage over their wives because of what Curtis (1986) 
defines as a contractual inequality based on the husband’s provision of gifts and 
favors. These incur a debt for the wife which is unspecified, diffuse, and "can 
be infinite in effect” (p. 179.) However, as Parker & Parker (1979; also Chafetz 
1980) note, as the gender division of labor outside the family changes, men’s 
resource advantage and therefore the nature of spousal exchanges do as well. 
Unlike some uses of exchange theory, in the hands of feminist scholars the macro 
level environment, which shapes the distribution of resources and therefore the 
opportunities/constraints of exchange partners, is taken as the explicit starting 
point in understanding the nature of microlevel exchanges, which, in turn, are 
often analyzed in terms of their feedback impact on macrolevel phenomena. 
The use of exchange theory by feminists therefore exemplifies what Risman & 
Schwartz (1989) refer to as a microstructural approach to understanding gender 


inequality. 


NETWORK AND STATUS EXPECTATIONS THEORIES 


Although representing very different theoretical traditions, feminist versions 
of status expectations and network theories both focus on how interactive rela- 
tionships are shaped along structured gender lines and result in gender differ- 
entiation and inequality. Their logics are therefore similar to those discussed 
in the last section in that they also represent microstructural approaches to the 
study of gender inequality. 

The feminist theory most carefully developed in tandem with a systematic re- 
search program deals with the relationship between gender, status expectations, 
and power/influence in goal-oriented groups (see review chapter by Ridgeway 
1993, Ridgeway & Berger 1986, Meeker & Weitzel-O' Neill 1977, Lockheed 
1985, Foschi 1989). The theory has implications for same-sex groups but 1s 
most fully developed and tested on mixed-sex groups (Ridgeway 1993). The 
central thesis is that, given the higher social status that accrues to males, both 
women and men typically enter mixed-sex groups with gender-based expecta- 
tions that male members will be more competent than females in moving the 
group toward task achievement, i.e. "performance expectations" are higher for 
men. However, the salience of gender status is situationally induced and there- 
fore context-specific (e.g. performance expectations may advantage women if 
the task is traditionally considered feminine). In the absence of a set of speci- 
fied factors that reduce the salience of gender-based performance expectations, 
they become self-fulfilling prophecies that function to reduce women's self- 
confidence, prestige, power, and influence in group interactions. Moreover, 
because gender-based expectations are defined by group members as legiti- 
mate, individual women's attempts to counteract them will be rejected by other 
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group members as inappropriate (Meeker & Weitzel-O'Neill 1977). The out- 
comes of mixed-sex groups will therefore usually reflect the preferences of 
its male members. Moreover, the process of group interaction will typically 
enhance the status and power of the male members, which is often "the basis 
on which many of the society's rewards of power, position, and respect are 
distributed" (Ridgeway 1993:193), that is, the basis of gender stratification. 

Smith-Lovin & McPherson (1993:223) assert: "In one sense, itis impossible 
to have a network theory of gender" because the theory is concerned with the 
nature of relationships between actors, not actor characteristics. Nonetheless, 
in the only explicit theoretical discussion of gender differences and inequal- 
ity from a network theory perspective, these authors convincingly argue that 
gender-related characteristics, usually viewed as essential differences devel- 
oped through socialization, in fact result from the long-term impact of seem- 
ingly inconsequentially small differences in the network positions and structures 
in which boys and girls are typically located. Beginning with an analysis of 
single-sex childhood networks, they call upon a wealth of empirical literature to 
demonstrate how gender homophilous networks cumulate over the life course 
to create even greater gender differences in adult networks, which foster gender 
differences in aspirations, opportunities, and behaviors. Because they reflect 
ongoing network phenomena, these outcomes are amenable to change in re- 
sponse to changes in the nature of, and locations within, network structures 
for women and men, and therefore to public policy intervention. The authors 
also review some of the current inadequacies of the network literature for un- 
derstanding gender segregation and inequality, weaknesses that, if addressed, 
would strengthen the general theory. These include: a focus on “small, unrep- 
resentative populations" that are almost entirely single-sex; a focus on "elites" 
that are almost always male; a focus on "ego nets" that loses sight of the organi- 
zational context within which networks evolve; and a focus on one network in 
isolation from others in which actors are simultaneously involved (pp. 243-44). 

Together, feminist versions of rational choice, exchange, status expectations, 
and network theories emphasize the importance of sociocultural structure for 
understanding the gendered nature of interaction and individual choice, and the 
patterned gender differences and inequalities that result from such interactions 
and choices. They thereby contribute important insights into the general theo- 
retical issue of the nature of macro-micro linkages. 


SYMBOLIC INTERACTIONIST AND 
ETHNOMETHODOLOGICAL THEORIES 


Feminist versions of symbolic interaction theory and ethnomethodology focus 
on gender as an ongoing accomplishment that emerges during interaction 
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processes, both between and within the sexes. This perspective is succinctly 
captured in West & Zimmerman’s term (1987) "doing gender,” which refers to 
the work done during interactions in order to constantly recreate the partners’ 
sense of their own and the other’s gender (also West & Fenstermaker 1993, 
Fenstermaker et al 1991, Fenstermaker Berk 1985, Goffman 1977). Gender 
is an “emergent feature of social situations” (West & Fenstermaker 1993:151), 
not a static feature of structure or set of individual-level traits. 

Gender is “omnirelevant” in that any action can be interpreted as exemplify- 
ing it (West & Fenstermaker 1993). Given the taken-for-granted view that there 
are two and only two sexes, and everyone belongs in (only) one of them, people 
characterize self and others by sex ("gender attribution") and then interpret and 
respond to virtually any kind of behavior according to its normative gender “‘ap- 
propriateness.” The nature of masculinity and femininity varies, but the notion 
that men and women are fundamentally different does not. People are constantly 
creating the sense of gender difference and defining self and others through that 
lens (Kessler & McKenna 1978, Goffman 1977). West & Fenstermaker (1993: 
157) assert: “persons engaged in virtually any activity can hold themselves 
accountable and be held accountable for their performance ... as women or as 
men...” and will be legitimated or discredited accordingly. A major corollary 
is that, while the specific relevance of gender is always contingent upon the 
interaction context in which behavior occurs (Fenstermaker et al 1991), it is no 
less relevant to single-sex than to cross-sex interactions (Gerson 1985). 

“Doing gender” not only (re)produces gender difference, it (re)produces gen- 
der inequality. One very important medium through which gender-construction 
Work occurs is conversation. Numerous analyses of male-female conservation 
and language usage have been conducted (e.g. Fishman 1982, Mayo & Henley 
1981, McConnell-Ginet 1978, West & Zimmerman 1977, Lakoff 1975). They 
conclude that conversation between men and women reinforces gender inequal- 
ity, primarily because "the definition of what is appropriate conversation be- 
comes men’s choice. What part of the world [they] . . . maintain the reality of, 
is his choice ...” (Fishman 1982:178). Men dominate conversations; women 
work hard to keep them going; women use verbal and body language in ways 
that weaken their ability to assert themselves and, therefore, reduce their power 
(McConnell-Ginet 1978, West & Zimmerman 1978, Lakoff 1975, Mayo & 
Henley 1981). 

Another major mechanism by which gender inequality is reproduced through 
interaction is scripting (West & Fenstermaker 1993). The social scripts for many 
tasks are specifically associated with gender, and people "do gender’ as part 
and parcel of doing them. Fenstermaker Berk (1985, also DeVault 1991) shows 
how the division of household labor, which numerous studies demonstrate is 
highly inequitable, provides the opportunity for both spouses to "do gender" 
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and reinforce their own and their partner’s gendered identities. Hochschild 
(1983) develops the concept of “emotional labor,” which refers to the need to 
hide or fake one’s feelings 1n order to please others, in discussing the gendered 
scripts associated with many traditionally female jobs. Kasper (1986) expands 
upon this concept, seeing it as integral to the scripts for female behavior in a 
variety of interaction contexts and as runctioning to deny women an "integrated 
autonomous identity of their own" (p. 40). In turn, this impedes women's ability 
to achieve in the public sphere. 

Schur (1984) uses an offshoot of symbolic interactionism, labeling theory, 
to demonstrate that femaleness constitutes a devalued and stigmatized master 
status that results in women being selectively perceived and reacted to primarily 
1n terms of stereotypes about femaleness (p. 25). This leads to objectification of 
women, or their treatment as things rather than as persons, which allows others 
to treat them in degrading and exploitative ways. The result is a self-fulfilling 
prophecy, whereby women come to see themselves as inferior and to suffer 
from low self-esteem, passivity, in-group hostility, and identification with their 
(male) oppressors. 

In an application of George Herbert Mead's concepts, Ferguson (1980) 
argues that men possess the power to define both specific situations and the 
generalized other. Women, therefore, “are defining themselves by reference 
to standards that brand them as inferior" (p. 155), thereby undermining their 
self-identity and producing self-blame for their problems. In addition, power- 
lessness forces women to become highly adept at taking the role of the (male) 
other, anticipating male wants in order to avoid negative sanctions; it prompts 
women to please, flatter, and acquiesce to men for the same reason (pp. 161—62). 
The result is that male power is buttressed. 

Feminist versions of ethnomethodology and symbolic interactionism focus 
on the microlevel processes by which gender differences and inequality are 
constantly (re)created in everyday life. By demonstrating that both cross- and 
same-sex interactions normally entail "doing gender," they suggest that gender 
is a fundamental feature and outcome pf all interaction, one that should comprise 
a central component of general interaction theories. 


NEO-FREUDIAN AND ROLE THEORIES 


No thinkers were more thoroughly criticized by feminists during the 1960s and 
1970s than Parsons and Freud. Nonetheless, the new specialty developed by 
feminist sociologists called itself by the Parsonian term, the sociology of sex 
roles; by the late 1970s, one ofthe most influential feminist theories in sociology 
was Chodorow's (1978, also 1974) neo-Freudian account of gender differentia- 
tion and inequality. The term “sex (gender) role" has since been abandoned by 
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feminist scholars because it obscures power inequities, thereby depoliticizing 
gender (Stacey & Thorne 1985), and because it fails to articulate situational 
variation in role enactment (Lopata & Thorne 1978, West & Fenstermaker 
1993:154—55). However, role analysis remains an important part of feminist 
theorizing because many specific social roles are entirely (e.g. wife/husband) 
or largely (e.g. numerous occupations) played by members of only one sex. 
Feminist Freudian theory continues to be developed (see reviews by Kurzweil 
1989 and by Williams 1993; also Chodorow 1989). 

The earlier sex/gender role perspective focused primarily on delineating the 
processes of childhood socialization (especially modeling, positive and nega- 
tive sanctions) through which, beginning at birth, boys and girls are taught by 
parents, and later peers, schools, media, etc, "sex appropriate" gender identities 
(said to be all but immutable by about age 3) and gender normative behaviors 
(which are presumably trans-situational and therefore applicable in all interac- 
tions) (e.g. Cahill 1983, Lever 1976, Constantinople 1979, Lewis & Weinraub 
1979, Coser 1986, 1975, Sattel 1976). This perspective is primarily rooted in 
cognitive development and symbolic interaction theories. Besides the problems 
mentioned above, the sex/gender role perspective also makes it all but umpossi- 
ble to explain gender-related changes at the individual or collective level. To the 
extent that childhood engenderment strongly shapes all subsequent behavior, it 
is difficult to explain how adults could change, and therefore, how new gener- 
ations of children could be taught different gender conceptions. Katz (1979) 
introduces a life-cycle perspective to issues of gender socialization, an approach 
that can better accommodate change. This approach is elaborated by Lopata 
(1994), who examines specific social roles associated with women, rather than 
general sex/gender roles. She focuses on how the major social roles played by 
women (especially wife, mother, relative, homemaker, and employee) change 
over the life course and how and why they have changed with societal modern- 
ization, thereby reducing the hypothesized impact of early childhood learning. 
Likewise, Johnson (1989, 1993) revisits Parsons to show how one can revise his 
evolutionary theory and analysis of family roles usefully to account for recent 
changes in women's roles. 

Like most socialization theories of engenderment, feminist neo-Freudian the- 
ory argues that, at a very early age, the two sexes develop gender identities and 
gender differentiated personalities that are highly stable over the life course. 
The two theory types differ in the processes by which this presumably occurs. 
The best known neo-Freudian feminist scholars are French, but the one who has 
most influenced feminist sociologists in the United States is Nancy Chodorow 
(especially 1978). Incorporating object relations theory into her revisions of 
Freudian thought, Chodorow argues that, because early childrearing is over- 
whelmingly a female task, children of both sexes have a woman as their primary 
love object. However, boys' and girls' Oedipal stage experiences and outcomes 
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are vastly different because only girls share the sex of their primary love object. 
Because girls need not separate form their mothers to attain a gendered iden- 
tity, they grow into women whose primary concern is with connection to other 
people. Given a different-sex primary love object, boys develop a gendered 
identity through separation, resulting in men who focus on individuation and a 
denial of affect. The gender-specific psychological orientations that result from 
the fact that women mother children of both sexes underpin male misogyny and 
dominance. Gilligan (1982) uses Chodorow’s theory to refute Kohlberg's levels 
of moral reasoning. She argues that women’s morality is different from (not 
at a lower level than) men’s because it is based on personal relationships and 
obligations rather than abstract principles (which Kohlberg privileges). These 
two works are widely cited by feminist sociologists, despite extensive critique 
of their essentialist logic, psychological reductionism, and other problems (see 
Williams 1993 for a review of those critiques, and see Lorber et al 1981). 
Like socialization explanations, feminist neo-Freudian theory makes gender- 
related changes all but impossible to explain. A different kind of criticism of 
Chodorow’s theory is developed by Johnson (1988), who concludes that it is 
fathers, not mothers, who reproduce gender differentiation in children and gen- 
der inequality among adults. Children of both sexes become "human" through 
interactions with their primary love object, a mother figure, who tends to min- 
imize gender difference. Fathers differentiate their children much more on the 
basis of gender. In addition, children observe their mothers playing the wife 
role, which models gender inequality in relationship to their husbands. 


CONCLUSION 


The Newest Trend in Feminist Theorizing 


The “hot topic" in the 1990s among feminist scholars is "the intersection of race, 
class and gender.” Edited books (e.g. Anderson & Collins 1995a, Rothenberg 
1992), special journal issues, program sessions, and a new section of ASA 
have been devoted to 1t. The central contention of this emerging focus is that 
the three forms of oppression are not separate and additive, but interactive 
.and multiplicative in their effects. However, to date, very little theory has 
been produced on the topic; the growing literature remains overwhelmingly 
descriptive, and too often descriptive of a sample of women of only one race 
and class (or even specific occupation). 

One exception is Collins (1990), who suggests several interesting ideas about 
how to theorize "one overarching structure of domination" that includes age, re- 
ligion, and sexual orientation in addition to race, class, and gender (p. 222). She 
argues that people can simultaneously be oppressed and oppressor, privileged 
and penalized; that no one form of oppression is primary, although individuals 
and groups often define one as more fundamental and others as lesser; and that 
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the matrix of domination has several layers (e.g. persons, group or community 
culture, social institutions), all of which are sites of potential resistance to dom- 
ination. Moreover, different systems of oppression may rely on varying degrees 
of systemic versus interpersonal mechanisms of domination (pp. 226-27). 

West & Fenstermaker (1995) reject the mathematical metaphor involved in 
Collins’s (also Almquist 1989, Glenn 1985, Anderson & Collins 1995b) idea 
of intersecting systems of inequality. They point out that no one can expe- 
rience gender without simultaneously experiencing their other statuses, so all 
outcomes are simultaneously “gendered,” “raced,” and “classed.” Using an eth- 
nomethodological approach, they argue that these are all ongoing accomplish- 
ments whose relevance cannot be determined apart from the context in which 
they are accomplished. Collins et al (1995) respond that West & Fenstermaker 
reduce oppression to difference and lose sight of the structural inequities that 
are fundamental to these statuses. 

These efforts constitute the barest beginnings of theorizing about how various 
forms of inequality relate to one another. The recently developing queer theory 
(e.g. see Seidman et al 1994), which deals with the social construction of 
sexual identity and preference labels, especially those that are “marginal,” is 
also interwoven with feminist theories. It is, however, beyond the scope of this 
chapter to discuss the theory. Both of these new issues reflect the perception 
by many feminists that prior theoretical efforts have been too middle class, 
white, and heterosexist, and that feminist theory must recognize diversity among 
women and therefore account for the multiplicity of forms of oppression, not just 
that experienced by otherwise privileged white women. Theoretical progress 
on the topic of how various systems of inequality interact could revolutionize 
the sociological understanding of social stratification, which, for too long, has 
theorized narrowly about inequality in terms of social class/status. 


Feminist Theory and Mainstream Sociology 


Sociologists have always assumed that, as Lorber states (1994:36), "For hu- 
mans, the social is the natural" However, until recently sociologists exempted 
gender from this assumption and largely ignored the topic. The most fundamen- 
tal contributions of feminist theories have been to demonstrate the thoroughly 
sociocultural nature of all aspects of the gender system and the omnirelevance of 
gender to social life. This corpus of work demonstrates the daily "hard work," 
conducted at the micro- and macrolevels by individuals and social collectivi- 
ties, that goes into (re)producing gender as a fundamental feature of social life, 
indeed, a more ubiquitous feature than social class. 

Feminist theorists have used virtually all theoretical traditions in sociology 
as springboards to understand the gendered nature of social life. In the process, 
they have offered rich and important critiques of the inadequacies of traditional 
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theories that have resulted from the masculine blinders their authors have worn. 
They have developed revisions of those traditions that broaden and deepen the 
discipline’s understanding of social life. Gradually, albeit too slowly, these 
perspectives are becoming incorporated into the mainstream theory canon. It 
is my hope that this review may hasten that process. 
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ABSTRACT 

The deteriorating economic well-being of children portends less well-adjusted 
adults and a diminished economic future for America. A disproportionate share of 
today's poor children will become tomorrow’s poor adults. This chapter discusses 
the concept, definition, and measurement of children's economic well-being and 
poverty. Children's current economic well-being is evaluated in comparative 
perspective—international, historical, and demographic. The chapter also eval- 
uates the etiology of changes in children's absolute and relative economic well- 
being, focusing especially on the role of the changing family, parental employ- 
ment, and levels of social provision for poor families. These "causes" are then 
evaluated in the context of recent public policy debates, including the devolution 
of federal welfare programs to the states. 


INTRODUCTION 


A widespread and justified perception is that an increasing share of American 
children are “at risk," both materially and psychoemotionally. By current stan- 
dards, children suffer disproportionately high rates of measured poverty, and 
they are often the innocent victims of rapid changes in our most fundamen- 
tal institutions—the family, school, and government. The breakdown of the 
traditional nuclear family has meant that an increasing share of children live 
with an unmarried, typically poor mother (Bianchi 1990, Hernandez 1993). 
Or, if they live with two parents, they receive their primary child care from 
nonfamily members or formal providers because both parents must work, of- 
ten at low wages, to ensure a living family income. The schools have been 
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unable to compensate for the social and economic deficits in the home (Booth 
& Dunn 1995). In fact, they are usually blamed for exacerbating the problems 
of poor children while promoting the intergenerational transmission of poverty 
(Corcoran 1995). Government often seems unable or unwilling to respond. For 
example, current debates about welfare reform—time limits and family caps— 
often come down to whether children in a rich society should bear the costs of 
current national and global macroeconomic shifts or whether they should suffer 
needlessly from the decisions of their parents regarding work and family life 
(Corbett 1993). 

The purpose of this review is to place the changing economic circumstances 
of American children in proper perspective—a comparative one. The objective 
is to define and evaluate current absolute and relative definitions of children’s 
changing economic deprivation and poverty. A comparative stance is neces- 
sary for accurately gauging the economic well-being of children across societies 
and for different subgroups of the population (e.g. racial and ethnic minority 
children). It is also required if we are to effectively monitor the changing 
well-being of children and measure the economic impact of public policy ini- 
tiatives, including nascent state welfare reform legislation. This paper sets 
current policy debates within a broader historical or international perspective 
and identifies those factors or circumstances—such as the rise in female-headed 
families and declining real wages among less skilled young adults—that have 
contributed to the currently high levels of relative economic deprivation among 
children. Finally, this review stresses the increasingly important but often ne- 
glected topic of growing inequality in the economic circumstances of America’s 
children. 

To be sure, evidence of increasing poverty rates among children and for di- 
verse segments of the population constitutes a strong basis for public concern. 
The short- and long-term deleterious consequences of poverty for children and 
for society are large and well documented (McLeod & Shanahan 1993, 1996, 
Hao 1995). Childhood poverty impairs physical growth, cognitive develop- 
ment (e.g. reading ability), and socioemotional functioning (e.g. behavioral 
problems, depression) (Hill & Sandfort 1995, Korenman et al 1995). Poor 
children are also more likely to be physically abused; indeed, effective par- 
enting suffers in economically stressful environments (Conger & Elder 1994, 
McLanahan & Sandefur 1994, Kruttschnitt et al 1994). The incidence, dura- 
tion, and chronicity of childhood poverty also have large negative effects on 
children’s IQ, educational achievement, and later adult productivity (as mea- 
sured by wage rates and hours worked), while increasing their adult welfare 
dependency (Zill 1993, Duncan et al 1994, McLanahan & Sandefur 1994). 
The implication is obvious: The effects of high rates of economic deprivation 
among today’s children may only be fully realized by tomorrow’s adults. 


Number In Poverty 
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MEASURING ECONOMIC WELL-BEING AND POVERTY 


Most discussions of children’s changing economic circumstances refer to the 
recent rise in children’s poverty rates as officially measured by the US Census 
Bureau. Published in their Current Population Reports (P60 series), these rates 
are used by governmental agencies to establish eligibility criteria for various 
social insurance programs and public assistance. The official poverty income 
thresholds were set in the early 1960s at three times the income necessary to 
maintain an economy food plan devised by the US Department of Agriculture 
for families of various sizes (Orshansky 1965). With only slight subsequent 
modifications, the poverty income thresholds are adjusted each year for inflation 
using the Consumer Price Index (CPI). In 1994, a before-taxes income threshold 
of $15,029 was set for a family of four (US Bureau of the Census 1996). 

Based on the Orshansky criteria, tne trend in child poverty has been one of 
substantial reductions during the 1960s, which leveled off during the 1970s, and 
then increased in the 1980s and early 1990s (Figure 1). In 1994, 15.3 million or 
21.8 percent of all American children lived in poor families. Although children 
comprised only 26.7 percent of the US population, they accounted for 40.1 
percent of all poor persons (US Bureau of the Census 1996). 
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This income-based measure of the prevalence of economic deprivation has 
many well-known limitations that confound policy discussions and have the 
potential to distort group comparisons in America’s increasingly diverse popu- 
lation (Haveman et al 1988, Ruggles 1990, Citro & Michael 1995). The measure 
is adjusted for inflation each year, but not for the increases in real family in- 
come and consumption by children. It does not take into account near-income 
(e.g. food stamps) or in-kind public assistance (e.g. subsidized school lunches) 
for children. It does not correct for geographic differences in the costs of 
feeding, clothing, and sheltering children. It does not adjust for income that 
represents the costs of working (e.g. costs of childcare) or that ts not available 
for the purchase of material goods or services (e.g. child support payments paid 
by the noncustodial parent at the expense of coresidential children), or that 
trades in the informal economy (e.g. bartering goods or services). Income is 
not adjusted for taxes, which have varied enormously across political units and 
over time. The various poverty thresholds do not adequately reflect economies 
of scale or measure “equivalent” poverty income for various family sizes and 
the adult-children composition. The measure also does not take into account 
the increasing share of children reared by single parents and their cohabiting 
partners, partners whose income often is excluded from calculations of official 
child poverty (Manning & Lichter 1996). Finally, despite rapid changes in the 
family, this measure still implicitly assumes that parental resources are invested 
in children—biological, step, noncustodial—in an equitable or altruistic way 
(i.e. equally or according to need). 

With so many well-known limitations, how can we be certain that economic 
deprivation—as measured by the official poverty rate—has really increased 
by one third over the last two decades among America's children? Some re- 
searchers suggest that poverty should consider actual consumption levels rather 
than income alone (Mayer & Jencks 1995). Consumption-based measures typ- 
ically show lower rates of poverty than do measures based on pre-tax income 
(Slesnick 1993, Lino 1996). The income generated in the household frequently 
is underreported in survey research (Edin 1995, Mergenbagen 1996), while 
household consumption is higher than expected on the basis of reported in- 
come. For 1988-1990, low-income families consumed goods and services 
worth more than twice their annual incomes (Mayer 1996). 

Poor families and children also have more material possessions than they 
did a generation ago (Federman et al 1996). A smaller percentage than in the 
past live in substandard housing (i.e. without indoor plumbing or electricity). 
A larger percentage eat better (as measured by caloric intake)—a testimony 
perhaps to the food stamp program and the comparatively cheap food supply in 
this country. And they have greater access to material goods that were simply 
unavailable in the past (1.e. TV, air conditioners, etc). For example, 54.696 of 
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persons in families receiving AFDC own a VCR (US Bureau of the Census 
1995). 

Such information, however, conveys little about the quality or adequacy of 
these material possessions or their relevance to children's well-being. More- 
over, a recent study (Federman et al 1996) shows that 55.196 of poor families 
experienced at least one of the following deprivations during the past year 
(i.e. in 1992): evicted from home; utilities disconnected; telephone discon- 
nected; housing with upkeep problems; crowded housing; no refrigerator; no 
stove; or no telephone. These same deprivations were experienced by only 1396 
of nonpoor families. The issue today arguably is less one of growing absolute 
material deprivation—that is still an empirical question depending on whether ` 
cash income or consumption is the relevant indicator—but rather whether child 
poverty or deprivation is morally defensible in an affluent society. 

Growing relative economic deprivation is also a concern, as children in poor 
families have fallen behind other population segments. As described in the 
following section, child poverty in America is different from poverty in other 
rich countries. It is different from the recent and distant past on many dimen- 
sions (e.g. reliance on welfare income). And it is increasingly heterogeneous 
in magnitude, etiology, and consequences across different demographic seg- 
ments (e.g. minority children, immigrants, children living with a single parent) 
of American society. Our preoccupation with absolute deprivation should not 
distract us from these relative dimensions of the family resources available to 
America's children (Duncan 1992, Lichter & Bggebeen 1993, Fischer et al 
1996). 


AMERICAN CHILDREN IN COMPARATIVE 
PERSPECTIVE 


Child Poverty Compared to Other Western Industrial 

Countries 

Whether child poverty rates in the United States are regarded as relatively high 
or low depends on one's standard of comparison. Rainwater (1995) compared 
poverty rates of families with children in the mid-1980s in the United States with 
seven other Western industrialized countries that spanned the range of social 
service provision. Using the Luxembourg Income Study database, he exam- 
ined poverty in market-oriented economies like Canada and in social democratic 
welfare states like Sweden and the Netherlands. The United States by inter- 
national standards had very high rates of child poverty (20.396 ), especially in 
comparison with Germany (4.9% ), Sweden (3.5% ), and the Netherlands (4%). 
The differences were even more striking among female-headed families with 
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children. The rate for the United States was 57.9% compared with 19.7% in 
France, 16.5% in the United Kingdom, and 7.9% ın the Netherlands. 

These rates were calculated as one half the median family income (adjusted 
for age of the head and family size). But, because real family income is higher 
in the United States than elsewhere, these rates may not be strictly compara- 
ble. One half of the median family income may have more purchasing power 
in the United States than elsewhere. Other research, however, has calibrated 
“equivalent” income poverty thresholds in other countries to allow international 
comparisons. For example, Smeeding & Torrey (1988) converted the US ın- 
come poverty line into the currencies of other countries. The official poverty 
threshold in the United States is roughly 42% of the median family income and 
1s equivalent to slightly more than 50% of median family income in Sweden. 
Even with poverty comparisons based on equivalent currencies, however, US 
children compared unfavorably with children in most other Western industria- 
lized societies considered. 

There are many underlying reasons for the comparatively high poverty rates 
among American children. Rainwater (1995) argued, for example, that the 
difference resides primarily in the low ameliorative effects of public transfer 
income on poverty, especially for female-headed families with children. For 
these families, countries with both low earnings and low transfer incomes rela- 
tive to the median equivalent income had the highest rates of child poverty. The 
United States ranked lowest when both factors were considered together, and 1n 
fact had the highest poverty rate among single-mother families with children. 

Smeeding & Torrey (1988) showed that poor children in the United States 
received most of their transfer income from means-tested welfare programs, 
while European countries relied more heavily on universalist social insurance 
programs such as child allowances that all families with children received. 
Indeed, that pre-tax and pre-benefit poverty was actually higher on average in 
Europe than in the United States (Smeeding & Rainwater 1991, cited in Rodgers 
1996). The limited safety net in the United States makes the adverse income 
effects of family disruption and unmarried childbearing especially large for 
women and coresidential children (Smock 1993, Casper et al 1994). 

Others argue that the high rates of poverty among women (and their children) 
in the United States reflect its demographic heterogeneity, especially with regard 
to family structure (Casper et al 1994). But this explanation alone is incomplete. 
Using the Luxembourg Income Study, Smeeding et al (1988) showed that the 
percentage of poor children in a country was unrelated to the proportion of 
the families headed by a single parent. Sweden, for example, had a higher 
percentage of their children borne to unmarried women than in the United 
States. But the poverty rate of families headed by unmarried women in Sweden 
was only 6.596 . For single parent families without a wage earner, the poverty 
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rates were 18.2 and 95.7%, respectively, in Sweden and the United States. 
These are extraordinarily large differences by any standard. 


Child Poverty Compared to the Past 


Child poverty is different today from poverty in the past. To be sure, the recent 
rise in income-based poverty among children, despite the limitations of such 
measures, has been an important catalyst for legislative advocacy on behalf of 
children. But absolute increases in cnild poverty are arguably less important 
than several other relative dimensions of the current poverty problem. The rise 
in child poverty has occurred simultaneously with rising age inequality, real de- 
clines in income among poor children, a growing gap between rich and poor chil- 
dren, increasing dependence on welfare income, a changing family and spatial 
ecology of poverty, and the growing incidence of chronic or persistent poverty. 

Children today are at once the poorest age segment in American society 
and the most vulnerable and innocent. Children have poverty rates that are 
nearly twice those of the elderly population, a situation without precedent in 
American history (Figure 2, see also Fuchs 1991). The rise in child poverty 
since 1980 occurred as poverty of the elderly declined substantially, largely 
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Figure 2 Ratio of children in poverty to elderly in poverty: 1959-1994. 
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through government social insurance, including social security and Medicare 
(Preston 1984). Concerns today about child poverty cannot be separated from 
the issue of growing generational inequities and governmental investments that 
favor the elderly over children (Fuchs & Reklis 1992). 

Poor children today are absolutely poorer than the poor children of a gen- 
eration ago. Duncan (1992) showed that the median real income of the poor- 
est families with children declined by 12% between 1970 and 1986 (see also 
Haveman et al 1988, Karoly 1994). Eggebeen & Lichter (1991) reported that 
the proportion of children in “deep poverty,” i.e. below 50% of the poverty 
threshold, increased by one third between 1970 and 1988. Moreover, the me- 
dian family income-to—poverty ratio among children in the bottom 20% of the 
family income distribution declined absolutely from .85 in 1969 to .68 in 1989 
(Lichter & Eggebeen 1993). The poverty gap—the income required to lift poor 
children out of poverty—has grown substantially over the past decade. 

Poor children today also are poorer in comparison to affluent children; i.e. the 
relative income gap between poor children and the "average" or "affluent" child 
increased sharply over the past decade or so. The percentage of children living 
in families below one half the median family income increased from 21.496 
to 27.296 between 1970 and 1988 (Eggebeen & Lichter 1991). In 1969, the 
income-to—poverty ratio among the most affluent 20% of American children was 
5.82 times greater than the figure for the poorest 20% . It was 8.72 times greater 
in 1989, because family income both declined among the poorest children and 
increased (by nearly 2096) among the richest children (Lichter & Eggebeen 
1993). 

The income mix of families of poor children today is different from two 
decades ago; they are now more likely to rely on welfare income and on other 
in-kind public assistance programs (Jensen et al 1993, Jensen & Eggebeen 
1994). Nearly 10 million US children received AFDC in the early 1990s, a level 
without precedent, and over 13.5 million poor children lived in households re- 
ceiving some form of means-tested assistance (US Bureau of the Census 1996). 
In 1989, 47.296 of poor children received cash assistance from government 
programs, compared with 24.6% in 1969. More important, the percentage of 
children's family income that is from welfare increased from 18.296 to 35.596 
over 1969-1989 (Jensen et al 1993). One quarter of all poor children in 1988 
were fully welfare-dependent (i.e. without any other source of income) com- 
pared with about 6% in 1959 (Hernandez 1993). This occurred at the same time 
when average welfare benefit levels for children declined and the antipoverty 
effectiveness of cash and near-cash transfers eroded (US Department of Health 
and Human Services 1996). 

Poverty rates increased most dramatically among the children of work- 
ing parents, rather than among the children of single parents (Annie Casey 
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Foundation 1996). The rising share of family income from welfare (noted above) 
is due primarily to significant declines in earnings from work among the poor. 
Indeed, a larger share of single mothers work today than in the past, and they 
have higher labor force participation rates than married mothers (Bianchi 1990, 
Hernandez 1993). The paradox is that work-based welfare solutions are now 
viewed as the panacea for poor children at the same time that the ameliorative 
effects of work have eroded, especially for women and the low skilled (Lichter 
et al 1994). 

Poor children today are arguably more likely than in the past to be socially 
and spatially isolated from the nonpoor in schools, neighborhoods, and com- 
munities, in both urban and rural areas (Massey & Denton 1993, Garrett et al 
1994, O'Hare 1995). In 1990, 54.69€ of poor children lived in neighborhoods 
with rates of poverty exceeding 2096 (Jargowski 1996). Moreover, for the 100 
largest metropolitan areas, a larger proportion of America's poor children lived 
in poor census tracts (defined as 4096 or more in poverty) in 1990 than in 1980 
(Johnston 1997). A growing policy concern is whether the increasing geo- 
graphic concentration of poverty in low-income urban neighborhoods or rural 
areas promotes maladaptive behavioral or cultural adjustments— juvenile delin- 
quency, school dropout, and nonmarital childbearing—that perpetuate poverty 
into adulthood (Lichter 1993). 

The family ecology of poor children is different today from that in the past. In 
the mid-1960s, about two thirds of all poor children lived with married-couple 
families or families with a male present (Rodgers 1996). Today, 8.5 million 
children (57% of poor children) live with a single parent. This a product of 
the acceleration of divorce rates during the 1970s and the continuing high rates 
of marital disruption today (Schoen & Weinick 1993). The poverty-producing 
effects of divorce are reinforced by out-of-wedlock childbearing, and nearly 
one third of all births today occur outside of marriage; thus about 50% of all 
children can expect ta live some part of childhood in a single-parent family 
1f current trends continue (Ventura et al 1995). The new freedom to make 
previously stigmatized choices about marriage and family life (e.g. divorce or 
cohabitation) may benefit adults but be harmful economically to their children 
(Bumpass 1990). 

Poor children today are more likely than in the past to be chronically poor, 
1.e. suffer persistent bouts of poverty or recurrent poverty (Duncan & Rodgers 
1989, Duncan & Rodgers 1991, Devine et al 1992). For 1992-1993, 8.3% 
of children were poor all 24 months, compared with 4.8% of all persons (US 
Bureau of the Census 1995). The average duration of poverty among female- 
headed families with children (during 1991-1993) was 3.9 months, compared 
with 6.9 months among married couple families with children (US Bureau 
of the Census, 1995). Long-term poverty implies policy solutions—training, 
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remedial education, job growth—that differ from the short-term relief (e.g. food 
stamps) required at a time of need (Duncan 1992). 

In sum, our current preoccupation with rising absolute poverty rates should 
not divert us from perhaps a more fundamental concern, i.e. that poverty for 
children is much different today than in the past along several dimensions. 
Today's poverty among children must be judged against the living conditions 
and consumption levels of society as a whole and other advantaged groups— 
current and past. It is with regard to this relative dimension of child poverty 
that the political rhetoric matches the reality. It also is this relative dimension 
that implies increasing social and cultural differentiation in the future as the 
current generation of poor children enters adulthood. 


Child Poverty Compared Across Diverse Population Groups 


Poverty and prosperity are experienced differently across diverse segments of 
America society (see Table 1). Most evidence suggests that economic inequality 
has grown over time, both within and between the rich and poor, more educated 
and less educated, blacks and whites, married-couple families and single-parent 
families, native-borns and immigrants, city dwellers and suburbanites (Danziger 
& Gottschalk 1993, Levy 1995, Karoly 1994). We have at the same time become 
more heterogenous as a society along these same demographic dimensions. An 
important research question today is not whether differences exist—they surely 
do (Table 1)—but whether conventional poverty measures accurately portray 
group differences, and whether the underlying etiology of poverty for different 


Table 1 Poverty and prosperity among US children 1994 


,  Povety =  Prospenty? — 
Number* Rate Number? Rate 
Total persons 38,059 14 5 171,818 65.7 
Total elderly 3,663 117 18,386 58.8 
Total children 15,289 218 39,248 561 
Total white children 9,346 169 33,949 61.5 


in married-couple families 4,268 99 30,305 701 
in female-headed families 4,099 45.7 2,517 28.1 
Total black children 4,906 43.8 3,537 316 
in married-couple families 617 14.6 2,468 582 
in female-headed families 3,935 632 909 14.6 
Total Latino children 4,075 41.5 2,146 28.0 
1n married-couple familtes 1,952 301 2,340 361 
in female-headed families 1,804 68.3 270 102 


(Source. Table 9, U S. Bureau of the Census 1996) 
"Number in 1000s 
P Above two times the poverty income threshold 
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groups can be identified and appropriately addressed. The former is important 
for identifying groups most "at risk," while the latter question is important if 
reducing inequality is a social goal. 

One criticism of the current official poverty measure is that it may not accu- 
rately reflect group differences in available family resources over time (Citro 
& Michael 1995). Perhaps the best example is the substantial and persistent 
heterogeneity in child poverty rates that exists across racial and ethnic groups 
in the United States (Hernandez 1993, Lichter & Landale 1995). In 1984, the 
official poverty rates of African American and Latino children were 43.8% 
and 41.5%, respectively, compared with a rate of 16.9% among white children 
(Table 1). But these differences in income poverty undoubtedly mask even 
greater differences in deep poverty, consumption and wealth, and long-term or 
chronic poverty. 

For example, the child poverty rate in 1994 was 2.6 times greater among 
blacks than whites (Table 1), but 3.4 times greater using a measure of deep 
poverty (i.e. one half the poverty threshold) (US Bureau of the Census 1996). 
Moreover, nearly half of US blacks had a household net worth of less than 
$5000, compared to 2296 among whites (US Bureau of the Census 1996). 
Black children also experience longer spells of poverty than whites (Duncan 
1992). Period rates also ignore the greater incidence of recurrent poverty among 
black children (Duncan & Rodgers 1991). For 1991-1992, 15.1% of blacks 
were poor each month, compared with 3.196 of non-Latino whites (US Bureau 
of the Census 1995). This means, simply, that a higher proportion of poor 
black children at a given point in time will have been poor for a longer period 
of time or will have experienced more episodes of poverty. Cross-sectional 
studies that include poverty as a predictor at a single point in time are implicitly 
understating levels of economic inequality. 

The conventional wisdom 1s that economic hardship is offset to some ex- 
tent by institutionalized patterns of informal economic support and exchange 
among some groups, such as African Americans. But the empirical evidence is 
mixed. Quantitative studies suggest that poverty differentials are likely to be ex- 
acerbated as a result of ethnic-group differences in intergenerational economic 
support (e.g. money, food, housing) among elderly parents, their children, and 
grandchildren (Hogan et al 1993, McGarry & Schoeni 1995). The wealthiest 
children—a disproportionate share of whom are non-Latino whites—received 
the most income support from kin and friends. Moreover, household strategies 
to adapt to economic stress may mask or diminish economic deprivation among 
minority children. For example, families may adapt to economic hardship by 
"doubling up" with other families (Angel & Tienda 1982). 

Children are an understudied segment among the immigrant population 
(Jensen & Chitose 1994). Over 1596 of Latino children are first generation 
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immigrants and another 44% are native-born children of foreign-born parents 
(Oropesa & Landale 1997). First and second generation Latino children ex- 
perience disproportionately high rates of measured poverty—41% and 30% , 
respectively. The new welfare bill includes provisions to exclude legal immi- 
grants (for up to 5 years), which means that future pretax/pretransfer compar- 
isons in official income poverty will further distort true disparities in well-being 
across immigrant and native children. 

At the same time, assessing relative racial and ethnic group differences— 
including new immigrants from Latin America and Asia—may be confounded 
by cultural differences in family formation processes. For example, child 
poverty among Puerto Rican children is exceptionally high in female-headed 
families relative to their counterparts in white families or in other racial and eth- 
nic minority groups (Lichter & Landale 1995). The poverty rate among Puerto 
Rican children living with a single mother and her cohabiting partner, however, 
would be cut in half if all household income were included in its calculation 
(Manning & Lichter 1996). 

To date, explanations for racial or ethnic differences (or other group differ- 
ences) typically have focused less on the methodological artifacts described 
above and more on variations in job skills, work patterns, and family life. The 
parents of African-American and other minority children, for example, are typ- 
ically less likely to be highly educated and to work at stable, high-paying jobs, 
and are more likely to be raising children alone (Hernandez 1993, Hogan & 
Lichter 1995). The earnings of college-educated, full-time employed men in- 
creased by 5.2% between 1979 and 1989 (Wetzel 1995), while men with a high 
school education experienced a real earnings decline of 6.5% . In the absence of 
educational equality between racial and ethnic groups, rising earnings returns 
to education in America will exacerbate interracial and interethnic differences 
in children's poverty and economic status. 

Racial and ethnic differences in work patterns among women in single-parent 
and married-couple patterns also contribute to disparities in the economic status 
of children (Hernandez 1993, Lichter & Eggebeen 1994), but the effects of ma- 
ternal employment vary widely across racial and ethnic groups. For black-white 
comparisons, employment differences account for virtually none of the black- 
white difference in child poverty among children in married couple households 
(Lichter & Eggebeen 1994), and only a small fraction of the racial difference 
among children living in single female households. On the other hand, over two 
thirds of the black-white difference in child poverty resided in the higher pro- 
portion of black children living in high-risk, female-headed families (Eggebeen 
& Lichter 1991). Racial inequality is further magnified by large differences 
in child support payments, a pattern reflecting racial differences in the income 
of noncustodial fathers and the higher nonmarital fertility rather than divorce 
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(i.e. where child support and compliance are lower among African Americans 
than whites). 

For other racial and ethnic groups, employment differences matter more. 
Lichter & Landale (1995) showed that a substantial percentage of the dispro- 
portionately high rates of child poverty among Mexican-Americans and Puerto 
Ricans was due to differences in work patterns. For example, the poverty gap 
between white and Puerto Rican children in single-parent families would be 
reduced by nearly 60% if Puerto Rican mothers had the same employment 
rates as Anglo mothers. Moreover, little of the high poverty rate among Puerto 
Rican children relative to other Latino groups (including Cuban-Americans 
and Mexican-Americans) can be attributed to the lack of local employment 
opportunities or to the concentration of Puerto Rican families in economically 
depressed racially or segregated cities (Landale & Lichter 1997). 

In the final analysis, changing demographic composition—racial, immigrant, 
family, economic—and increasing child poverty are mutually reinforcing. Con- 
tinuing sociceconomic and racial differentials in mortality and fertility, for ex- 
ample, contribute to a relatively higher proportion of minority children and 
lower proportion of minority elderly in the American population, which fur- 
ther exacerbates observed poverty differentials between the young and the old. 
Moreover, without significant reductions in racial inequality, recent and future 
compositional changes in America necessarily imply future increases in the 
proportions of children who are poor. Without the economic assimilation of 
new ethnic immigrants (or reduced immigration), America’s demographic fu- 
ture necessarily implies increases in the child poverty rate. Without reductions 
in the inequality between men’s and women’s earnings, which gives rise in part 
to large economic disparities between children across family types, the “break- 
down” in the family implies increases in poor children in America. Finally, 
without significant reductions in economic inequality across racial, ethnic, and 
gender groups, growing population heterogeneity implies increases in the pop- 
ulation of poor children. 


THE ETIOLOGY OF CHANGING CHILD POVERTY 
AND INEQUALITY 


Why has income poverty and economic inequality—the absolute and rela- 
tive dimensions of poverty—increased among the families of children in the 
United States over the last two decades? The main reasons have been largely 
identified—changes in family structure, changing patterns of employment and 
earnings, and changes in public assistance. But much less agreement exists 
about the relative importance of these factors and, by extension, their possible 
solutions. 
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Changing Family Structure and Family Size 

Most discussions of child poverty begin with the transformation of the American 
family. On the one hand, a growing share of American children live with a single 
mother, while poverty rates for this group have remained high at nearly 50% 
for at least 30 years (Bane & Ellwood 1989). The simplicity of the argument 
contributes to its widespread appeal. If a greater share of children live in “high- 
risk” single-parent families today than in the past, then a greater proportion 
of children will be poor in the absence of any changes in single women’s 
economic position. Indeed, Duncan (1992) showed that virtually all of the rise 
in poverty for families with children between 1970 and 1986 was due to the 
growth in single-parent families. Eggebeen & Lichter (1991) observed that 
roughly 50% of the post-1980 increase in poverty was attributable to changes 
in family composition. In a time series regression analysis, Rodgers (1996) 
explained 70% of the annual variation between 1966 and 1993 in the child 
poverty rate by the proportion of families with children headed by unmarried 
mothers. 

To critics, such analyses appear to “blame the victim” or “blame women" for 
the rise in child poverty (Baca Zinn 1989, Hernandez 1993, Luker 1996). Such 
analyses also imply that children now living in female-headed families would 
have the same rates of poverty as children currently living in married couple 
families 1f only their mothers stayed married or became married. Increasing 
child poverty may not result from rising female headship, but rather rising 
economic hardship may be responsible for the breakdown in the family. In fact, 
Bane (1986) showed that 2596 of white single mothers and 7596 of black single 
mothers were poor before they became single mothers. 

Few studies, however, have actually tracked children's economic trajectories 
over time (Duncan & Rodgers 1989, Duncan 1992), and fewer still have related 
changes in family income shifts to changes in children's family circumstances. 
Bianchi & McArthur (1991) examined, with data from the Survey of Income 
and Program Participation, children's economic circumstances before and af- 
ter their parents divorced. Children in married couples that subsequently split 
up had an income-to-poverty ratio of 2.35 (before separation) compared with 
an income-to-poverty ratio of 2.87 among children with two parents always 
present. At the same time, the marital breakup resulted in a 12.696 decline 
in the income-to-poverty ratio among children, while the income-to-poverty 
ratio among children in intact families increased by 6.696 (between the first 
and eighth interviews). Duncan (1992) similarly showed that poverty among 
children of divorce increased from 12% to 27% during the period between 
one year prior to and one year after the divorce. After five years, the poverty 
rate dropped to 20%, but much of the improvement for children was due to 
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remarriage. Bianchi & McArthur (1991) showed that the income-to-poverty 
ratio increased from 1.39 to 2.63—a 90% increase—among children whose 
mother remarried. 

The Bianchi-McArthur study implies that marital disruption and low family 
income are mutually reinforcing. But the child poverty problem today also 
resides in the rise of nonmarital fertility among young mothers, a trend with 
no necessary economic cause. The mse in the ratio of nonmarital-to-marital 
fertility is inextricably tied to the rise in child poverty, in part because the 
poverty rates among children of unmarried mothers are substantially higher than 
among children of either married or divorced parents (Bianchi 1993). Compared 
with divorced mothers, unmarried mothers typically have less education and 
job-related skills, they are younger, they have younger children (which makes 
maternal employment difficult and childcare expensive), and they are much less 
likely to receive income assistance from the noncustodial father. 

Finally, a paradox of current family processes is that the rise in child poverty 
rates has more to do with declining and low fertility among the nonpoor pop- 
ulation than it does in the fertility behavior of the poor. Although family sizes 
have declined across all income groups, nearly 25% of the rise in child poverty 
between 1978 and 1987 resulted from increasing differentials in the mean num- 
ber of children living in poor and nonpoor families (Eggebeen & Lichter 1991). 
At the same time, the decline in fertility in the United States overall, especially 
during the 1960s, was an important demographic factor responsible for rapid 
declines in child poverty during this period (Gottschalk & Danziger 1994). If 
fertility rates had remained unchanged, child poverty rates would have risen 
7 and 3 percentage points for blacks and whites, respectively. Accordingly, a 
rise in fertility among the nonpoor would reduce measured poverty rates, but it 
would do little to help the 16 million children who are currently poor. 


Changing Parental Work and Earnings 

This emphasis on family change is consistent with the empirical and historical 
record, although it should not divert attention from the fact that poverty and 
economic inequality among American children also are inextricably linked to 
the earning capacities of children’s parents. Poverty among children living 
with two full-time working parents is virtually nonexistent (Hernandez 1993, 
Hogan & Lichter 1995). To be sure, this does not mean that simply moving all 
nonworking parents into the labor force will eliminate child poverty (Lichter 
& Eggebeen 1994, Lichter & Gardner 1996—97). Parents currently in the labor 
force are different (1.e. education and skills) from those not now working. Nor 
should changing employment rates per se be interpreted as the main engine of 
change in long-term trends in children's economic circumstances. For children 
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living with two parents, the rise in maternal labor force participation rates, espe- 
cially among young married mothers, means that more—not fewer-—children 
today than in the past live with two parents contributing to family income (Her- 
nandez 1993). For children living in female-headed families, a larger share of 
single women today than in the past (e.g. 1960) are employed. Indeed, 196 
more children of unmarried mothers would be poor if unmarried mothers had 
the same employment rates as observed in 1960 (Eggebeen & Lichter 1991). 

An obvious substantive implication is that the child poverty is less a problem 
of work than of finding work that pays a nonpoverty wage. Poverty rates 
among workers (using various definitions of worker) increased during the 1980s 
(Blank 1996), while an increasing share of the poor were working (Lichter et al 
1994). 'This reflects macroeconomic shifts, especially the transformation of 
the economy from high-wage manufacturing jobs to low-wage service jobs, 
declining unionization, and global competition for cheap labor from abroad. 
Moreover, the minimum wage has also declined substantially in real terms 
since the early 1970s relative to the average wage of all workers. At the 1996 
minimum wage of $4.25 hour, a full-time year-round worker would earn $8,500 
annually. The income poverty threshold for a two-parent family with one child 
was $11,929 in 1994. Even when the new minimum wage of $5.15 goes into 
effect in 1997, the annual earnings of $10,300 is still $1,629 below the current 
poverty threshold. 

More education is understandably viewed as the best solution to this problem. 
But it is worth noting that increases in child poverty occurred during the 1980s 
at the same time that educational levels generally increased and school dropout 
rates declined. In fact, educational levels actually increased during the 1980s 
more rapidly among the poor than the nonpoor (Shapiro 1989). This reflects 
the changing demographic composition of the poor; from an historical perspec- 
tive, elderly persons, a group with lower levels of education, have experienced 
declines in poverty (for nonwork-related reasons) while young adults—those 
bearing and raising children— have experienced increases in poverty despite 
high levels of education. The implication is straightforward: The post-1980 
rise in child poverty cannot be attributed to significant declines in human capital, 
job skills, or the work efforts of children's parents. 

The changing demography of the poor population means that policies that 
stress macroeconomic growth may have only limited benefits for many poor 
people. Indeed, a "rising tide lifts all boats" is an inappropriate aphorism today, 
especially for children; local economic conditions are only modestly associated 
with children's poverty status (Friedman & Lichter 1997, Landale & Lichter 
1997). Other factors also play a role. For example, the tendency for highly 
educated persons to marry each other has increased over time (Mare 1991), even 
if we adjust for the increasing demographic supply of highly educated women. 
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In addition, the rise in maternal employment was greater among women married 
to high-earning males (e.g. the top earnings decile) than among women married 
to low earners during the past decade or so (e.g. the bottom decile). Thus, 
educational marital homogamy and maternal employment have the potential to 
increase both family income inequality and the potential disparities between 
married couples with two workers and one worker (although the effects are 
small; see Cancian et al 1994), At the same time, growing income inequality 
was evident among all types of families with children, but especially among 
those headed by single mothers (Karoly 1994), 

The rise in child poverty associated with increased childbearing among un- 
married women also presumably has a work-related explanation. A common 
argument is that poor unmarried women lack access to “marriageable” or “eco- 
nomically attractive” men; thus, marriage is not an economically viable option 
for many poor women (South & Lloyd 1992, McLaughlin & Lichter 1996). In 
the absence of sufficient marital opportunities, the human need to love and be 
loved may be met through nonmarital childbearing and in single-parent fami- 
lies. Indeed, we have only a crude understanding of the social and economic 
conditions, cultural factors (including values and norms), and biological fac- 
tors (i.e. changes in hormonal levels) that are responsible for increased sexual 
activity, pregnancy and abortion, and out-of-wedlock childbearing among un- 
married women (see excellent discussion by Luker 1996). The recent emphasis 
on shortages of economically attractive males to marry necessarily implies that 
nonmarital fertility is “planned,” i.e. women's fertility decisions are based on 
a rational calculus that takes into account marital opportunities, the economic 
attractiveness of potential marital partners, and other alternatives (including 
welfare). But most nonmarital births are in fact unintended (Ventura et al 1995), 
a fact that militates against strictly economic interpretations or arguments that 
attribute teen childbearing and rising poverty to shortages of “good men" to 


marry. 
Changing Public Assistance and Government Policy 


Another common explanation for the rise in child poverty resides with the role 
of government policies. The new emphasis on behavioral poverty, i.e. poverty 
presumably caused by the bad decisions and bad values of parents, has been 
linked to explanations that stress the deleterious role of government, especially 
cash assistance programs like AFDC, in undermining the traditional family. 
Alternatively, others argue that government is not doing enough; indeed, the 
federal government has cut average cash assistance payments over the past 
decade and the new welfare bill will shift much of the responsibility for chil- 
dren’s well-being to states. Is it too much government or too little government 
that has contributed to the relative decline in children's economic status? 
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For some analysts (Murray 1984), the decline in family values follows from 
the generosity of welfare programs that encourage nonmarital fertility and di- 
vorce, which indirectly harms children economically and emotionally. Most 
evidence indicates that such policies have played a modest role at best in family 
formation processes (Moffitt 1992). Garfinkel & McLanahan (1986) suggested 
that about 15% of the rise in female-headed families for 1960-1975 was due to 
rising welfare benefit levels. Another common argument is that welfare creates 
disincentives to work, especially among unmarried mothers, which reinforces 
the impoverished position of children living in female-headed families. Yet, 
Bane & Ellwood (1989) reported that the family income of children living with a 
single mother who worked full-time was $6817 (in 1986 dollars) compared with 
$6284 among children living with a nonworking single mother. Welfare is an op- 
tion nearly as attractive as very low-wage work. The question remains whether 
it is high welfare or low earnings that is the more important disincentive to work. 

The preceding discussion suggests that governmental policies, by under- 
mining the traditional family, may have had an indirect negative effect on the 
well-being of American children. But the effects also may be more direct. For 
example, the average AFDC and food stamp benefit level for a mother with two 
children and no earnings declined from $10,169 to $7,471 between 1972 and 
1991 in constant dollars (Haveman & Wolfe 1993). The ameliorative effects of 
welfare are, not surprisingly, both low and declining. In 1979, 4296 of children 
were lifted out of poverty as a result of means- and nonmeans-tested cash and 
noncash transfers. This ameliorative effect declined to 32% by 1991. This 
compares unfavorably to the high and increasing effects observed among the 
elderly, from 77% to 81% over 1979—1991 (Ozawa 1995). Declining welfare 
benefit levels have contributed directly to the deteriorating economic well-being 
of poor children. 

For working parents and their children not now dependent on welfare, re- 
ducing child poverty may require government intervention to help parents earn 
more (Plotnick 1989, Bianchi 1993). Raising the minimum wage potentially 
is one option that supports working families while presumably increasing the 
incentives to work. Another option is to expand the Earned Income Tax Credit 
for families with workers, especially those with previously weak attachments 
to the labor force (such as young adults). The availability of high-quality and 
subsidized childcare also will keep money in the hands of working parents, thus 
potentially benefiting children. Although it is unclear whether such efforts will 
slow the rise in income inequality (Danziger & Gottschalk 1994), they would 
most likely help those children at the bottom of the income distribution if work 
is available. 

Enacting such legislation, however, will require a greater appreciation that 
children are a public good and that investments in children today are necessary 
for tomorrow’s future (Fuchs & Reklis 1992, Bianchi 1993). To the dismay 
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of some observers, recently signed welfare reform legislation will eliminate 
the largest cash assistance program that benefits poor children—-AFDC. Public 
assistance programs will instead be administered through the mechanism of 
block grants to states for the purpose of tailoring programs to reflect local needs, 
resources, and values. Time limits on continuous participation (i.e. 2 years) and 
lifetime participation (i.e. 5 years) will be imposed, and new work requirements 
for recipients will be stipulated. One study estimates that an additional 1 million 
children will become poor as a result of the welfare legislation ("Personal 
Responsibility and Work Opportunity Reconciliation Act") passed by Congress 
and signed into law by President Clinton in August 1996 (The Urban Institute 
1996). 

The economic effects of new welfare policies on children are difficult to fore- 
cast; indeed, dire predictions typically are wrong because they stimulate new 
policies that avert the forecasted outcome. Any effects on children's economic 
well-being will ultimately depend on how states respond to their new charge and 
how responsive state and federal governments are to new exigencies or problems 
with the proposed system. Consequently, this is an especially propitious time 
for research on poor children. New workfare and time-limiting welfare pro- 
grams, administered at the state level, will provide a natural laboratory for study 
of children. Presently. however, accurate predictions of the short- or long-term 
consequences for America's children simply lack a solid basis in data. 

In sum, the literature on the role of government regarding child poverty is 
highly polarized and politicized. The emphasis on different causes implies dif- 
ferent political agendas and different policy solutions. Behavioral explanations 
that emphasize the personal choices regarding work and family life are typi- 
cally set against structural explanations that emphasize the role of labor market 
conditions and government. But the emphasis on family change is not incom- 
patible with structural explanations focused on a declining economy for young 
and low-skilled parents. Monocausal explanations simply are incomplete and 
inadequate. It is equally important to keep in mind that the "causes" of poverty 
differ for different groups of children—whites and blacks, children in single- 
parent families and those living with two parents, and children of working and 
nonworking parents. Poverty rates will not be lowered among children without 
a multifaceted approach that builds strong families, promotes work and higher 
wages, and supports family and prowork public policies, including welfare 
policies. 


POVERTY TRANSITIONS AND INCOME STREAMS 


The level of family income that defines children as poor or not poor is ulti- 
mately arbitrary. Ás a concept and measure, the prevalence of poverty per se is 
arguably less important than how the income and material resources available to 
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children change over childhood, regardless of whether family income changes 
lead to poverty (as officially defined) or not. Divorce, for example, has poten- 
tially serious economic implications for children that do not always involve an 
episode of poverty but could impact children’s home economic environment 
dramatically in the form of lower consumption levels (without the benefit of a 
governmental safety net). Economic deprivation is relative; in this case, it is 
defined in terms of the family’s past levels of income and consumption. 

The dynamic nature of the family income stream also raises important ques- 
tions about the transitory or chronic nature of economic deprivation (however 
defined) and income inequality among children (Walker 1994). Income streams 
are linked in obvious ways to the entry into and exit out of welfare (Rank 1994, 
Harris 1996). Moreover, evidence of rising poverty among children and increas- 
ing polarization in the income distribution, although important, must not cause 
us to lose sight of the dynamic nature of shifts both upward and downward 
in the income distribution. We have only a very rudimentary understanding 
of the extent to which children are mired in poverty or in the lowest income 
quintile, and the extent to which poverty thresholds and income quintiles are 
permeable. A recent census study (Eller 1996) found that only 17.7% of poor 
children in 1992 exited poverty in 1993, and that a disproportionate share of 
children became newly poor in 1993 (4.096 vs 3.096 for all people). It remains 
an empirical question whether family transitions (including marriage and di- 
vorce), parental job mobility and employment dislocations, or governmental 
interventions matter most in transitions into and out of poverty. 

The dynamic nature of income and other economic resources makes accurate 
measurement problematic, especially in light of the transitory nature of the new 
family living arrangements of children. For example, should cohabiting couples 
(including many divorced women with young children) be treated conceptu- 
ally and analytically as quasi-married couple families, pooling the incomes 
of unmarried partners for the purpose of calculating child poverty rates? The 
conceptual problems are daunting (Eggebeen et al 1996, Manning & Lichter 
1996). Cohabiting unions are highly unstable, and there is little evidence re- 
garding the nature of intra vivos transfers of income or wealth (1.e. the extent 
to which a cohabiting partner contributes income for the benefit of children in 
the household). These are not small or esoteric issues. Nearly one half of ail 
individuals will cohabit before marriage, nearly 30% of all nonmarital births 
occur to cohabiting couples, and rates of cohabitation among divorced women 
with children are very high (Bumpass & Sweet 1989, Manning 1993). 

Finally, the transitory nature of family forms, the instability of most low-wage 
jobs, and the revolving door of welfare also contribute to a dynamic mix of in- 
come sources available to children. Is it absolutely low income (i.e. official 
poverty) or relatively low income that promotes deleterious socioemotional 
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outcomes for children? Or, alternatively, is it the changing mix of income 
sources that affects children's short- and long-term well-being? The effects of 
poverty may be quite different depending on whether the family income avail- 
able to children is from maternal work (i.e. where some income may be diverted 
to childcare) or from welfare (i.e. which may stigmatize persons, promote de- 
pendency, or reduce parental self-esteem). One recent study suggested that the 
relative mix of income from stepparents and biological parents has little ef- 
fect on various children's outcomes, such as academic achievement and school 
problems (Hanson 1995). The income and involvement from the noncustodial 
father may likewise be a source of economic support that could have positive 
as well as negative effects on the lives of children (Blankenhorn 1994, King 
1994, Seltzer 1994). The economic contribution, however sporadic, of noncus- 
todial fathers may create obligations in female-headed families (i.e. visitation) 
that foster emotionally unhealthy, conflictual, or unwanted contacts between 
estranged parents or between noncustodial fathers and children. This is one 
argument for child support assurance programs in which the government acts 
as the intermediary between estranged partners with children. 


CONCLUSIONS 


Since the “War on Poverty" in the 1960s, literally thousands of studies have been 
published on the topic of poverty (Sawhill 1988), including innumerable studies 
of poor children. Yet, there are roughly as many poor children today as before, 
the rate of child poverty now is at a 30-year high, and the income gap between 
rich and poor children is greater than at any time in recent memory (Lichter & 
Eggebeen 1993, Fischer et al 1996). Whether this apparent lack of progress 
reflects the inadequacies of the research, limitations in the efficacy of social 
policy, or the changing nature of poverty's causes is open to debate. It is clear, 
however, that previous research has not been translated into effective policy that 
benefits poor children, if measured by official trends in poverty and inequality. 

No review can adequately convey the deep theoretical and empirical contro- 
versies and ideological differences that have marked previous research on the 
extent and etiology of poverty in America. Nor can it convey the tension that 
exists between dispassionate policy analysis and the passionate rhetoric often 
needed to initiative legislative action. This makes it difficult to separate the 
hyperbole about children's lives from the facts. At the same time, accurately 
measuring children's economic well-being and inequality, and accounting for 
trends so as to inform public policy, is more important than ever in light of cur- 
rent welfare reform and the new emphasis on behavioral poverty born of spatial 
and social isolation (Wilson 1987, Massey & Denton 1993). Indeed, increas- 
ing cultural differentiation—now and in the future—ultimately may depend 
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more on relative differences in economic resources (i.e. inequality between and 
within groups) than on absolute poverty or low income per se. 

As a field, we need to develop more sensitive measures of well-being that 
better reflect the relative and episodic nature of economic deprivation during 
childhood. We need to revisit previously ignored or discredited cultural argu- 
ments of welfare dependence and poverty. We need to identify the adaptive 
strategies of families (e.g. doubling-up or marriage) that surely will emerge as 
a result of welfare reform and that will either reinforce existing inequalities or, 
alternatively, cause families and their children to become economically self- 
sufficient (Lichter & Gardner 1996—97). We need to study the possible recip- 
rocal relationships between poverty (especially during childhood) and biology 
(including physiology, cognitive traits, and temperament) that may perpetuate 
inequality from generation to generation. We need to better understand inter- 
group antagonisms and conflict, especially among racial and ethnic groups, and 
the growing class cleavages that make effective policy responses to poverty and 
welfare problematic. In short, conventional measures, traditional sociological 
and economic theories of poverty, and rigidly conservative or liberal ideologies 
can no longer be accepted uncritically or dismissed ad hominen if the future of 
America's poor children is to be anticipated with hope and promise. 
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ABSTRACT 


Infant mortality has long been viewed as a synoptic indicator of the health and 
social condition of a population. In this article we examine critically the struc- 
ture of this reflective capacity with a particular emphasis on how new health care 
technologies may have altered traditional pathways of social influence. The infant 
mortality rate is a composite of a series of component rates, each with its own 
relationship to social factors Advances 1n health care have reduced dramatically 
the risk of mortality for the critically 111 newborn, thereby elevating the impor- 
tance of access to this care in shaping absolute and disparate infant mortality rates. 
These advances in health services technology have also had the effect of concen- 
trating infant mortality among extremely premature and low birth-weight infants, 
a group tied directly to social factors operating through maternal influences and 


the general well-being of women. 


In this manner, current patterns of infant mortality in the United States provide 
a useful illustration of the dynamic interaction of underlying social forces and 
technological innovation in determining trends in health outcomes. We review 
the implications of this perspective for sociological research into disparate infant 
mortality, including the social and economic structure of societies, access to health 
services, the potential for prenatal intervention, women's health status, and racial 


and ethnic disparities 
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DISPARITIES IN INFANT MORTALITY RATES: 
NO LONGER A SIMPLE SOCIAL MIRROR 


High rates of death and disease among individuals of lower social and economic 
status have been extensively documented in industrialized countries throughout 
the past century. Indeed, in spite of substantial increases in economic indicators 
such as gross national product (GNP) and increases in dollars spent on medical 
care in the United States, socioeconomic disparities in mortality and morbid- 
ity persist to the present day (Marmot et al 1987, Hahn et al 1995, Pappas 
et al 1993). This documentation has been hailed as “one of sociology's most 
enduring contributions to the health field" (Williams & Collins 1995). 

Infant mortality rates in particular have long been considered a sensitive in- 
dicator of the impact of socioeconomic disparities on the health of populations, 
due in large part to the special vulnerability of the newborn to poverty and 
substandard living conditions (Marx 1967). In this manner, infant mortality 
rates have typically been seen to represent a kind of "social mirror," reflect- 
ing broad inequalities in society and, because of the inherent innocence of 
young children, illuminating the machinery of social injustice (Wise & Pursley 
1992). This reflective capacity has been circumstantially supported by the per- 
sistence of disparities in infant mortality rates throughout this century, mirroring 
the persistence of larger social and economic inequalities in the United States 
(Children's Bureau 1921, Gortmaker 1979a, Wise et al 1985). 

A premise of the present review, however, is that mechanisms that tradition- 
ally defined infant mortality as a social mirror have been altered appreciably 
by rapid innovations in health services technology. Such technological change 
also creates new opportunities for socioeconomic differentiation as life-saving 
therapies or preventive interventions potentially are made available only to 
the economically advantaged. This, in fact, currently happens on an interna- 
tional scale, where many seriously ill newborns in developing countries die 
of conditions currently treatable by health services commonly available in the 
industrialized world. 

The sociological literature has generally focused on the relationship between 
social and economic variables and the risk of infant mortality, but it has largely 
ignored the role of changing health care technology. In this review our aim is to 
illustrate the fact that there is now abundant evidence for the substantial impact 
of health services technology in reducing infant mortality rates, as well as for the 
powerful influence of social and economic inequalities independent of health 
services technology. We review the evidence for declining infant mortality 
rates in the United States and other industrialized countries, and we document 
the continuing disparities in rates by social and economic characteristics of the 
population. These different trends suggest that the interaction of social and 
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technologic determinants has operated to decrease absolute rates while simul- 
taneously maintaining broad disparities in these rates (Wise & Pursley 1992). 
In this empirical context, the perspective of Ogburn (Ogburn 1969) suggests 
that innovation in technology has resulted in a broad array of sociological con- 
sequences, only some of which have been examined. 

We have accordingly structured this review to critically examine three issues: 


1. The evolution of infant mortality patterns in the United States 


2. The extent to which the long-documented mechanisms of socioeconomic 
influence on infant mortality have been altered by a technical revolution in 
modern health services for pregnant women and newborns 


3. The impact of these altered causal pathways on the conceptual legitimacy of 
using infant mortality rates as a marker of social and economic inequalities 


While our focus is on infant mortality, the empirical and theoretical work 
we cite can be directly applied to other arenas of disparate health outcomes as 
well. We focus on the United States experience, although similar issues arise 
in the other industrialized countries of the world. 


THE EVOLUTION OF INFANT MORTALITY 
PATTERNS IN THE UNITED STATES 


One of the most enduring indictments of social and health policies in the United 
States is the consistently poor showing of the United States in international rank- 
ings of infant mortality. Although the United States is the wealthiest country 
in the world and expends more per capita on health and medical care than 
any other country, our infant mortality rates consistently rank below those of 
many other nations—twenty-second in 1990, below Italy, Spain, and Singapore 
(UNICEF 1992) (see Figure 1). While there are some reporting differences be- 
tween the United States and other countries that contribute to these disparities 
(e.g. in some countries, an infant death during the first few hours after birth is 
not counted as a live birth), these can account for only a part of the differences 
observed (Hartford 1992). The clear message derived from these data is that 
the United States ranks lower than all the other industrialized countries in infant 
survival. 

In addition to a relatively high overall rate of infant mortality, there are 
pronounced disparities within the United States among social, economic, racial, 
and ethnic groups and geographic areas. Some communities in the United States 
experience infant mortality rates resembling those of developing countries. The 
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Figure 2 Black and white infant mortality rates in the United States 1950-1993, 


reasons for these disparities are complex but are generally thought to reflect the 
powerful effects of social and economic inequality. 

One important indicator of continuing disparity is provided by trends in 
the infant mortality rates of black (African-American) and white infants in the 
United States during the period 1950-1993 (see Figure 2). These data document 
the persistently higher rate of infant mortality among African-American infants 
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Figure 3 Infant mortality rates by household yearly income in the United States 1964—1966 and 
1988. 


in the United States and describe a black/white disparity that has not been 
narrowing during the past three decades. 

A striking recent confirmation of the persistence of economic disparities can 
be seen from tabulations of data from two national infant mortality studies 
in the United States, the 1964—1966 National Natality and Infant Mortality 
Followback Surveys (NNS and NIMS) (National Center for Health Statistics 
1972) and the 1988 National Maternal and Infant Health Survey (Sanderson et al 
1991) (Figure 3). These data provide the only national estimates of disparities 
in infant mortality by household income. The income categories are only 
approximately similar, because categories of income were used in the survey, 
adjusted for changes in the Consumer Price Index (United States Bureau of the 
Census 1989). 

The 1964—1965 data precedes the implementation of Medicaid and thus pro- 
vides important "baseline" information concerning economic disparities. The 
1988 data comes after more than a decade of operation of this major health care 
financing program that includes poor families with children. Harly evaluations 
of the implementation of Medicaid indicate that it helped to reduce dispari- 
ties in the utilization of health services between poor and nonpoor families 
(Butler & Scotch 1978), while more recent studies have documented problems 
in access and declining physician participation in Medicaid (Yudkowsky et al 
1990). The data displayed in Figure 3 indicate no attenuation in the relative risk 
of infant mortality experienced by women with lower, compared with higher, 
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levels of household income during the past quarter of a century. These findings 
mirror those observed in Britain where, decades after the implementation of 
the National Health Service, social class disparities in infant mortality persist 
(Black et al 1982, Gray 1982). 

The apparent lack of progress in improving the international ranking or re- 
ducing social disparities in infant mortality in the United States should not, 
however, be perceived as suggesting a lack of change in the overall rate or its 
underlying dynamics. Quite to the contrary, the past two decades have wit- 
nessed the most profound alterations ever recorded in the structure of infant 
mortality patterns in the United States. This dynamic character challenges the 
traditional social interpretation of the infant mortality rate and lies therefore at 
the heart of this discussion. 

Although disparities persist, absolute rates of infant mortality for all groups 
in the United States have fallen dramatically over the past three decades. The 
infant mortality rate in 1990 was approximately 32% of the rate recorded in 
1960. Similar dramatic reductions were seen for both black and white newborns, 
with the 1990 rates among whites being about 30% of the rate in 1960 and the 
1990 rate among blacks being 37% of the earlier rate. Therefore, the persistence 
of social disparities in infant mortality has occurred in a setting of significant 
reductions in absolute rates, a consequence of the rapid improvements in health 
services technology surrounding childbirth. 


THE IMPACT OF HEALTH SERVICES TECHNOLOGY 


In order to explore the determinants of these complex trends, it is useful first to 
recognize that the infant mortality rate is actually a composite of a number of 
component rates, each with its own set of relationships with social factors and 
health services technology. One broad categorization separates infant deaths 
into those occurring during the neonatal period (the first 28 days following birth) 
and those occurring during the postneonatal period (the rest of the first year of 
life). This distinction, based on age at death, serves as a traditional proxy for 
cause of death. Postneonatal deaths have historically been due to viral and 
bacterial processes such as gastroenteritis and pneumonia and to injuries, all 
of which have long been closely linked to socioeconomic factors like poverty- 
level income (Gortmaker 1979a). Neonatal mortality has been more deeply 
shaped by processes associated with birth, including congenital anomalies, 
prematurity and low birth weight, and obstetrical catastrophes associated with 
delivery. These causes, though generally associated with social variables, tend 
to operate more directly through maternal and pregnancy-related pathways. 
Although the divide between the causes of neonatal and postneonatal mortality 
has never been sharp, this stratification still holds some importance in that it 
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attests to the heterogeneity of infant death and depicts an initial fork in potential 
pathways of social influence on infant mortality (Wise 1990). 

In this context it is important to note that in 1990 approximately 63% of 
infant mortality in the United States occurred in the neonatal period, compared 
to 70% in 1950. This continuing concentration of death in the first 28 days has 
been due to a precipitous decline of both postneonatal and neonatal mortality 
during the 1960s, 1970s, and 1980s. While postneonatal mortality rates remain 
higher than those of many other industrialized countries, death in this age 
group from injuries and infectious processes has become relatively rare in the 
United States. The largest contributors to postneonatal mortality today include 
the consequences of prematurity and sudden infant death syndrome (SIDS), a 
diagnosis that probably captures a variety of pathologies but 1s broadly defined 
by the sudden, unexpected death of a previously healthy infant. Although SIDS 
is associated with an important social gradient, it is less profound than those 
related to more traditional postneonatal causes. 

While postneonatal mortality has been altered significantly, neonatal mor- 
tality has been most substantially transformed. Because neonatal mortality is 
so deeply tied to the health of the newborn at birth, it is useful to stratify the 
neonatal mortality rate into two components based on birth weight, the most 
generally meaningful indicator of risk at birth. Birth weight operates in this 
manner because it reflects the adequacy of the intrauterine environment and 
serves as a general proxy for gestational age, which is difficult to assess ac- 
curately in large populations. Stratifving the neonatal mortality rate by birth 
weight results in defining two mechanisms by which differences in neonatal 
mortality can occur: differences in the distribution of birth weights [a larger 
portion of births falling into the very low birth weight (less than 1500 g) groups 
would generate a higher risk of neonatal death in the population] or differences 
in birth weight-specific mortality (higher mortality within each birth-weight 
group would generate higher overall neonatal mortality). 

When neonatal mortality trends are examined in this manner, two major 
findings emerge. First, recent reductions in neonatal mortality in the United 
States have been due almost entirely to improvements in birth weight-specific 
survival; birth-weight distributions have remained relatively stable over the 
past 30 years. For example, national evidence indicates that during the period 
1960-1980, 15% of the decline in white infant mortality risk can be attributed 
to improving birth-weight distribution, and 85% to improved birth weight- 
specific mortality. Among blacks, —396 of the change in infant mortality rates 
during this period can be attributed to a worsening birth-weight distribution, 
and 103% to declining birth weight-specific mortality (Buehler et al 1987). 

Second, the primary determinant of the poor international standing of the 
United States and of social disparities in the infant mortality rate in the United 


154 GORTMAKER & WISE 


Mortality Rate per 1,000 





1000-1499 1500-1999 . 1000-1499 1500-1999 
Year and Birthweight Category 


Figure 4 Neonatal mortality rates by birth weight and race United States, 1960 and 1980. 


States is an elevated rate of extremely low birth-weight (1.e. less than 1000 g) 
infants; the survival rates of US newborns when birth weight 1s controlled 
are among the best in the world (Guyer et al 1982, Alberman et al 1992). 
Therefore, the substantial improvement in birth weight-specific survival in 
the United States has had the effect of concentrating mortality into the most 
premature and very low birth-weight categories, thereby making the United 
States infant mortality rate increasingly sensitive to factors associated with 
extreme prematurity. More than 5096 of all neonatal deaths in the United States 
are currently due to newborns born at less than 25 weeks of gestation (the legal 
abortion limit is 24 weeks), and infants born weighing less than 1500 g make 
up 1.296 of all births but 6496 of all neonatal deaths (Wise et al 1995). 

What then is responsible for these major improvements in the survival rates 
of ill newborns? A series of studies have identified a revolution in medical 
management of high-risk deliveries and critically ill newborns as accounting 
for this improved survival. These dramatic reductions are graphically displayed 
in Figure 4, for both black and white infants weighing 1000—1999 g at birth 
during the period 1960-1980. During these two decades, the rate of neonatal 
death among black infants weighing 1500—1999 g at birth decreased 6696, and 
among whites this decrease was 67% (Buehler et al 1987). 

The dramatic effectiveness of this technology, however, did not lead to an 
attenuation of social and economic disparities in infant mortality rates. The 
similar decline in rates among both black and white infants indicates similar 
access to this technology during this period. One study of a geographic region 
of the United States with virtual universal access to neonatal intensive care 
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technology (Boston in the mid 1980s) reveals substantial social and economic 
differences in infant mortality rates, even though virtually all of these births 
occurred in Level III centers, meaning that all of the women and their infants 
had available the most sophisticated medical technology at birth (Wise et al 
1988). These findings indicate that the causes of the decline in infant mortality 
rates—intensive care technology—are different from the effects of social and 
economic forces that still produce disparate outcomes. When effective medical 
technology is available to all in a geographic area, social and economic forces 
still emerge as visible factors. 

In addition, this explanation also suggests that increases in infant mortality 
rates over coming decades could, paradoxically, be possible in areas that achieve 
an excellent system of access to neonatal intensive care and relatively low 
infant mortality rates. If restrictions in access to this neonatal intensive care 
occur, mortality rates could increase. As managed care systems increase in 
prominence in the United States in coming years, potential changes in access 
to services for high-risk infants need to be carefully evaluated (McCormick & 
Richardson 1995). 

The remarkable success of neonatel intensive care has in many respects been 
overshadowed by questions of the appropriateness of intensive interventions 
for the most premature and seriously il] newborns. There are clear questions 
of cost although neonatal intensive care unit (NICU}-associated care is a mi- 
nuscule component of overall health care spending, and studies have indicated 
NICU technology to be cost-effective as well as cost-beneficial for all but the 
very smallest infants (Office of Technology Assessment 1987, Boyle et al 1983). 
Moreover, the improved survival of ill newborns has produced little evidence of 
an aggregate increase in the prevalence of chronic childhood disorders, prob- 
ably due to improvements in medical care (Shapiro et al 1983, Gortmaker & 
Sappenfield 1984, McCormick 1993). Although the exploration of the ethical 
and resource allocation issues involved in neonatal intensive care is, of course, 
important, and although the identification and elimination of iatrogenic health 
services must continue, such considerations should not obscure the documented 
effectiveness of neonatal intensive care in reducing mortality, nor the huge im- 
pact of these interventions 1n altering the nature of infant mortality in the United 
States. 

We suggest that the structure of infant mortality in the United States has been 
altered forever by major technical advances in the clinical management of the 
high-risk pregnancy and the critically ill newborn. This contention, in turn, 
demands a reconsideration of the approaches used traditionally to examine the 
social determinants of infant survival as well as the practical strategies long 
advanced as being the most effective means of preventing infant death. 
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SOCIAL INFLUENCE AND TECHNOLOGICAL 
INNOVATION 


Perhaps the most striking characteristic of the vast literature concerned with 
the causal pathways of infant mortality is its profound dichotomization. One 
view focuses on the direct impact of social and economic environments on the 
risk of infant mortality, while the second focuses on the impact of variations in 
health services (Gortmaker 1979a). One example of the first approach to the 
infant mortality problem in the United States is provided by Marsden Wagner, 
in remarks to the National Commission to Prevent Infant Mortality: 


Infant mortality is not a health problem Infant mortality 1s a social problem with health 
consequences. It is analogous to traffic accident mortality in children: the first priority 
for improving traffic accident mortality ın children 1s not to build more and better medical 
facilities, but rather to change traffic laws and better educate drivers and children In other 
words, the solution is not primarily medical, but environmental, social and educational. The 
same is true for infant mortality: the first priority is not more obstetricians or pediatricians 
or hospitals, nor even more prenatal clinics or well-baby clinics, but rather to provide 
more social, financial and educational support to families with pregnant women and infants 
(Wagner 1988). 


This perspective focuses on the powerful role of social and economic forces 
in the production of disparities in infant mortality rates, and for many sociolo- 
gists itis a signal illustration (in a world preoccupied with the drama of medical 
discovery) of the importance of social status in defining health. With deep his- 
torical roots, this elevation of social causation in defining an inherently tragic 
health outcome has held special currency as a stark counterargument to the 
claims of modern medicine. Steadily improving socioeconomic environments 
of families, including improvements in food supplies and in sanitation, have 
traditionally been credited with causing reductions in infant mortality and im- 
provements in life expectancy. Only a very marginal part of this improvement 
in mortality has been attributed to improvements in the delivery of effective 
medical care. As Dubos noted in 1959: 


The control of childhood diseases. .resulted more from better nutrition and sanitary prac- 
tices than from the introduction of new drugs It 1s remarkable, in contrast, that little 
practical progress has been made toward controlling the diseases that were not dealt with 
by the nineteenth-century reformers (Dubos 1959). 


This elevation of social pathways as fundamental is, of course, well-supported 
by an extensive research base throughout this century, and itis precisely why the 
infant mortality rate is so widely embraced as a legitimate indicator of underly- 
ing social conditions. However, as we have discussed, recent decreases in infant 
mortality rates are due solely to improved health care technology. Social and 
economic disparities in infant mortality rates persist, but very little empirical 
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data documents the mechanisms whereby inequalities are ultimately expressed 
in the biologic reality of infant death. Associations have been reported with 
infant mortality rates in multiple arenas where inequality is manifest, including 
the mother and household as discussed above, as well as the community and 
the larger society (LaVeist 1992, Kennedy et al 1996). Yet, one of the central 
tasks of any sociologic theory is to explain how inequality in these domains is 
ultimately transformed into infant death. This process, which might be termed 
“domain transformations” requires greater empirical inquiry. Do social and 
economic inequalities deprive people of the necessities of life such as nutrition, 
shelter, or social relations? Is social stress a major influence? Do these condi- 
tions increase the risk of infection or produce more risky health behaviors such 
as smoking (Gortmaker & Wise 1994)? Are multiple aspects of stratification 
involved, including characteristics of individual, household, and community? 
How do racism, sexism, and social class interact to impact infant health (Krieger 
et al 1993)? Is longer-term poverty a more fundamental cause than low income 
represented by the typical yearly income measure (Starfield et al 1991)? 

One concern is that a tight focus on broad associations of infant mortality 
with social variables such as poverty, low educational attainment, or race and 
ethnicity can disrespect the power of technical interventions to alter both the 
intensity and nature of social and economic influence on infant survival. While 
it is important to document associations of inequality and infant mortality, it 
is even more important to understand underlying causal mechanisms, so that 
effective policies and programs can be developed. If a clinical or public health 
intervention is developed that is highly effective in preventing infant death from 
causes associated with poverty, then much, if not all, of a domain’s established 
influence may fall away. Effective interventions, therefore, have the capability 
of uncoupling long recognized social influences such as poverty from specific 
outcomes such as infant death (Gortmaker & Wise 1994). Independent of such 
interventions, policies and programs can also focus on the often more difficult 
task of alleviating the underlying inequality as a social strategy. 

Technical innovation can both modulate pathways of social influence and also 
create social influence for conditions traditionally insensitive to social factors. 
To illustrate this point, Jencks described the case of phenylketonuria (PKU), a 
genetically based disorder that can be controlled by a properly regulated diet 
(Jencks 1980). While PKU is a disorder that 1s completely genetic in origin, 
because there is a highly effective health intervention (diet in this case), any 
disparity in outcome can now be viewed as social in origin, particularly related 
to access to the intervention. 

As noted above, new technology can also lead to increases in disparity. One 
newly emergent case that may fit this pattern is that of surfactant therapy for res- 
piratory distress syndrome, a major cause of infant mortality due to prematurity. 
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After this therapy became available, neonatal mortality rates improved more 
for white than for black infants with similar very low birth weights (Hamvas 
et al 1996). 

In this context, the impact of neonatal intensive care on recent trends in infant 
mortality in the United States provides an important arena for examining how 
technical innovation can alter traditional pathways of social influence. Such 
alterations are likely to be important for health conditions for which technical 
improvements in prevention or treatment are advancing rapidly. 

The influence of social and economic forces on the prevalence of the HIV-1 
(AIDS) virus is useful to explore from this perspective. The HIV-1 virus is 
not a major cause of infant mortality, but in some areas it has emerged as a 
substantial cause of early childhood death. A number of social factors have been 
associated with the risk of HIV-1 in infants and young children, in particular, 
maternal exposure to injection drug use (either her own or her sex partner's). 
Broader social forces are also at work: Housing destruction apparently led to 
the migration of a large IV drug-using population, with high rates of HIV-1 
infection, throughout New York City (Wallace et al 1995). 'The development 
of new combination therapies for HIV-1—infected women and evidence for the 
effectiveness of the drug zidovudine (AZT) in preventing perinatal transmission 
(Connor et al 1994) (from infected mother to newborn child) have generated 
a new, powerful arena of social influence in determining relative access to an 
effective but often expensive intervention that can save infant lives. 

Our intent in stressing the impact of technological innovation is not to dimin- 
ish the fundamental importance of social forces in shaping patterns of infant 
mortality nor the relevance of sociology to their essential exploration. Rather, 
it is to call attention to the inherent volatility of this social influence and the 
importance of the sociology of health care in explaining this dynamic character. 
In this way, our objective in invoking the efficacy of clinical and public health 
intervention is not to limit the realm of sociology, but to plead for recognition 
among infant mortality specialists of an opportunity to expand their disciplinary 
embrace to include the social forces that shape both the dimensions of medical 
progress and differential access to the fruits of this technical struggle. 


FUTURE DIRECTIONS IN SOCIOLOGICAL 
RESEARCH 


We point out the dynamic nature of the causes of infant mortality in order to 
highlight the importance of critical sociological research in illuminating how 
social and economic forces and health services technology will shape the risk 
of infant death in the United States in the years to come. 
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Societal Characteristics and the Generation 
of Disparities in Infant Mortality Rates 


A wide range of evidence indicates relationships between social and economic 
fortunes of societies and the risk of infant death. Some of the most dramatic 
historical evidence comes from studies of social structural change following 
World War II: Holland experienced very large increases in infant mortality as 
a consequence of starvation following the end of the war (Smith 1947, Susser 
& Stein 1994). Perhaps the most profound increase in infant mortality in an 
industrialized country to be observed during the past 30 years occurred in 
the USSR during the period 1971-1975—a 34% rise from 22.9 to 30.6 per 
1000 (Davis & Feshbach 1980, Eberstadt 1981). The authorities in the USSR 
stopped publishing infant mortality rates soon thereafter, but the documented 
increases are unlike those seen anywhere in the industrialized world. More 
recent reports indicate a worsening of infant mortality rates in many Eastern 
European countries in the 1980s (Nanda et al 1993). Note that these recent 
increases occurred in the absence of modern neonatal intensive care capability. 

Recent cross-sectional studies of countries and states indicate higher mor- 
tality rates in areas with greater income inequality (Waldman 1992, Kennedy 
et al 1996). Segregation indices are substantially correlated with rates of infant 
mortality among cities in the United States (Polednak 1991, LaVeist 1992). 
One study pointed out that countries with higher defense spending experience 
higher infant mortality rates (Woolhandler & Himmelstein 1985). 

Others have examined yearly fluctuations in the economy. Brenner analyzed 
time-series data for the United States and other industrialized countries during 
the first half of the twentieth century and found evidence for an association 
between variations in rates of unemployment at the national level and rates of 
neonatal mortality (Brenner 1973, Brenner 1983). More recent evidence in 
support of this relationship has been reported, based on data from Los Angeles 
County, although the authors note verv small effects (Catalano & Serxner 1992). 
One of the few individual level randomized studies based on the negative income 
tax experiment indicated that an income supplementation program can improve 
birth-weight outcomes (Kerher & Wohlin 1979). 

Substantial limitations to these analyses of generally aggregate data exist: 
One fundamental problem is the difficulty of making causal inferences from 
geographically aggregated data (Susser 1994, Geronimus et al 1996). More 
importantly, these analyses generally do not provide insight into the pathways 
whereby social and economic conditions or the lack of effective health ser- 
vices may be producing disparities in infant mortality. During these times of 
enormous change in both the social and economic reconstruction of societies, 
including the current focus on “welfare reform" in the United States, and the 
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reorganization of health services, it is crucial that different potential routes of 
influence are critically evaluated. 


Access to Health Services 


One of the central consequences of the increasing efficacy of health services 
for the newborn is an increasing burden on society to provide these services 
equitably. In this sense, technical progress elevates the importance of access in 
the determination of disparities in mortality outcome. 

Government-funded medical financing programs in the United States such 
as Medicaid have grown enormously in size and scope during recent decades. 
The Clinton administration attempted without success to considerably restruc- 
ture the present health care system in the United States (The President’s Health 
Security Plan 1993)—a system that can provide excellent quality of medical 
care, but that also leaves a substantial fraction of the population without fi- 
nancial coverage for health care (Blendon et al 1989). It is also a system that 
focuses most readily on treatment of illness, and less often and effectively on 
the prevention of illness. Research based on the National Health Insurance 
Experiment indicates that providing complete financial coverage for inpatient 
hospital care to children will not lead to mushrooming medical care costs. There 
is also no evidence to justify reimbursing acute ambulatory care at a higher rate 
than preventive services (Leibowitz et al 1985). However, the United States is 
still one of only two industrialized countries in the world (South Africa being 
the other) that do not provide financial coverage for health care for all pregnant 
women and children (Miller 1987). 

It is not clear whether socioeconomic disparities in infant mortality could 
be eliminated by universal access to health services for children and women 
in the United States. We have already noted that disparities remain in Great 
Britain even though the National Health Service provides accessible care to 
the whole population. Universal coverage should help to reduce the number of 
areas reporting excess rates of infant death throughout the United States, now 
generally the poorest communities. 

As neonatal intensive care technology has increased in effectiveness, access 
to this technology has become more crucial, and yet there are few studies of 
access to this care. Only a minority of hospitals can afford to staff neonatal 
intensive care units (NICUs), and this has lead to the regionalization of ser- 
vices (McCormick & Richardson 1995). One analysis of a metropolitan area 
in the 1970s found no social class disparities in neonatal mortality among very 
low birth-weight infants, indicating relatively uniform access to this emergency 
health service (Paneth et al 1982). A more recent analysis found a similar at- 
tenuation of the association of social factors and infant death among lower 
birth-weight infants (Eberstein et al 1990). Another study of four states in 
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the early 1980s indicates substantial variation in access to NICU care, with 
sick rural infants most at risk of being born in a hospital without these capa- 
bilities (Gortmaker et al 1985). The most crucial changes, however, may now 
be appearing as managed care plans grow throughout the United States, and 
as pressures to limit costs may limit access to these services (McCormick & 
Richardson 1995). One recent analysis from California indicates substantially 
lower mortality and no greater costs among high risk births in hospitals with the 
best neonatal intensive care technology (Phibbs et al 1996), providing evidence 
that any limitations 1n access to this care in California could result in increased 
infant deaths. Therapid changes occurring in the organization of health services 
in the United States provide important opportunities to examine the sociology 
of access to these services—and corresponding health consequences—in the 
decades ahead. 


Prenatal Risks and the Potential for Intervention 


Much of the public debate about infant mortality has focused on the potential for 
prevention via improved access to "prenatal care.” While there are no definitive 
randomized controlled trials of the effectiveness of traditional prenatal medical 
care (for obvious ethical reasons), the preponderance of evidence supports the 
effectiveness of prenatal care in reducing the risk of low birth weight and in in- 
fluencing the outcomes of pregnancy (Institute of Medicine 1985), particularly 
among high-risk populations (Gortmaker 1979b, Murray & Bernfield 1988). A 
wide variety of clinical circumstances make clear the benefits of prenatal medi- 
cal care (Chalmers et al 1989). Careful monitoring during pregnancy means that 
high-risk deliveries and premature labor can be identified quickly and referred 
to the appropriate tertiary center for care, and this can lead to reduced mortality. 

In spite of this evidence, the past few decades have demonstrated that im- 
provements in prenatal health care have not substantially affected infant mortal- 
ity rates. We have already noted that little of the decline in the infant mortality 
rate has been due to improved birth-weight distribution (the presumed outcome 
of improved of prenatal care) (Buehler 1987). It seems clear that expansion of 
access to the existing system of prenatal care will not solve the infant mortality 
problem, even though prenatal care can have greater impact as better outreach 
and delivery systems are devised (Institute of Medicine 1988). The develop- 
ment of more effective prenatal medical care, however, is for the most part 
contingent on a much better scientific understanding of the causes of preterm 
delivery, a phenomenon of largely unknown etiology (Paneth 1995). 

In spite of these limitations, a variety of health-related behaviors of women 
increase the risks of infant mortality, including cigarette smoking, alcoholism, 
drug use including cocaine, poor weight gain during pregnancy, sexually trans- 
mitted diseases, and 1nadequate nutrition. These behaviors in turn are influenced 
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by social and economic context, and hence intervention strategies focused on 
these risks can take many forms, ranging from clinical interventions (e.g. smok- 
ing cessation during the prenatal period) to state policy (e.g. cigarette taxation). 
Many of the interventions focused on these risks reside at least in part outside 
of the medical care system. 

The evidence for the effects of cigarette smoking as a determinant of low 
birth weight (less than 2500 g) (Kramer 1987, Kleinman & Madans 1985) and 
on infant mortality (Kleinman et al 1988) is particularly striking and strong. 
Attributable risk estimates indicate that elimination of smoking among white 
married women could reduce rates of low birth weight by 19% (Kleinman et al 
1985). Low-cost smoking cessation programs for pregnant women are effective 
(Windsor et al 1993) but not widely implemented. One barrier to this imple- 
mentation is the fact that insurance carriers and Medicaid in general do not pay 
for these programs. An area of state policy with potentially substantial impact is 
increased excise taxes on tobacco, which can reduce cigarette consumption and 
discourage initiation of smoking among youth (Lewitt & Coate 1982, Manning 
et al 1991). Furthermore, bans on advertising and sales (Warner et al 1992), re- 
striction of smoking in public places, and restricted sales of cigarettes to minors 
all significantly affect smoking rates in the United States. 

Recent concern has focused upon the effects of cocaine and crack—a smoke- 
able form of cocaine~—in increasing the risk of premature labor. However, the 
estimated effects of cocaine and marijuana use upon fetal growth appear to be 
less than those estimated for the effects of cigarettes, even when valid mea- 
sures of marijuana and cocaine use are applied (Zuckerman et al 1989). Given 
the lower prevalence of cocaine and marijuana use in the population, the at- 
tributable risks of poor birth outcomes associated with cigarettes appear much 
greater than those associated with cocaine. 

Also operating outside the medical care system but producing higher birth 
weights is the Special Supplemental Nutrition Program for Women, Infants 
and Children (WIC). A series of studies from the United States indicate that 
participation in WIC produces higher birth weights (Kotelchuck et al 1984, Rush 
1986). There is also evidence that FDA regulations concerning the nutritional 
content of food can improve birth outcomes: Supplementation of diet with folic 
acid, for example, can reduce the population risk of neural tube defects, which 
are major birth defects (Milunsky et al 1989). 

Other prenatal risks and possible effective interventions have been identified. 
For example, studies of women’s work have identified physically demanding 
work as contributing to prematurity and low birth weight (Naeye & Peters 1982, 
Homer et al 1990) as does environmental pollution (Mushak et al 1989) and 
toxic substances in worksites (Hemminiki et al 1983). Stress also increases 
risk of low birth-weight and preterm birth (McClean et al 1993), and social and 
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psychological support produce positive effects during pregnancy (Elbourne et al 
1989). One intriguing intervention study finds evidence for the effectiveness 
of nurse home visiting during pregnancy in improving birth outcomes among 
disadvantaged women (Olds et al 1986). 

However, this evidence concerning risk and intervention does not necessar- 
ily help explain some of the more persistent disparities in infant mortality. For 
example, smoking behavior is not a likely explanation for observed racial dis- 
parities in birth-weight distribution because black women smoke cigarettes at 
rates roughly comparable to those of white women (Prager et al 1984). Studies 
controlling for smoking, alcohol use, and other social and economic charac- 
teristics fail to eliminate black/white differences in rates of low birth weight 
(Shiono et al 1986, Kramer 1995). This case illustrates the reality that an anal- 
ysis of risks for infant mortality may not necessarily provide the best focus for 
an analysis of causes of disparity in infant mortality among population groups. 
This is the case with another often cited “cause” of high infant-mortality rates 
among black infants: high rates of teenage childbearing. Data clearly indicate 
that teenage pregnancy is associated with only a small minority of infant deaths, 
and so such data do little to help explain racial disparities in rates (Wise 1993). 

These prenatal risks and interventions, therefore, remain an important arena 
of social and technical influence on infant outcome. However, prenatal effects, 
viewed in isolation, do not explain much of the differentiation in infant mortality 
rates over time or between different social groups. Rather, prenatal influences 
are perhaps best understood when viewed in a broader context that includes 
social and clinical mechanisms that operate over the continuum of a woman's 
reproductive life. 


A Focus on Women's Health Status 


The impact of neonatal intensive care in concentrating infant mortality into the 
most premature group of newborns holds important implications for continued 
progress in reducing overall levels of, as well as disparate, infant mortality rates. 
Perhaps the most crucial need is to reconsider the current preoccupation with 
tightly focused efforts to intervene during the prenatal period. While our review 
has indicated evidence for effective strategies during the prenatal period, we 
have also noted that a large proportion of infant deaths occur to infants born 
before 25 weeks gestation. Thisisa very narrow window in which to concentrate 
prevention efforts. A more rational strategy would focus on linking improved 
parental care with broader initiatives to improve the health of women regardless 
of pregnancy status (Wise et al 1995). 

Unfortunately, trends 1n the social, economic, and health status of women of 
childbearing age in the United States are not encouraging. The proportion of 
children living in poverty and of single-headed female households has increased 
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over past decades. The proportion of children living in poverty in the United 
States declined during the 1960s and then increased during the 1970s and 1980s 
(Select Committee on Youth and Families 1989). Rates of children living in 
poverty in the United States are higher than those of the other industrialized 
countries (Smeeding & Torrey 1988). 

A variety of health characteristics among women indicate a risk for poor 
pregnancy outcome, including elevated blood pressure, older age at childbirth 
(due mainly to improved fertility treatment—associated with increased risks of 
multiple births and prematurity), cigarette smoking, infections, sexually trans- 
mitted diseases, and alcohol and drug use. A review of trends in these factors 
over the past decades indicates no substantial evidence for improvement in 
women’s health status (Gortmaker & Wise 1994). These issues may be particu- 
larly important for poor and minority women in the near future, in light of recent 
“welfare reform” legislation in the United States that will reduce future fed- 
eral spending on poor households. One hypothesis concerning maternal health 
status is that optimal maternal health for childbearing may occur at different 
ages for African-American women, compared to non-Hispanic white women, 
in the United States, because health status declines with age among minority 
women due to the continuing impact of difficult socioeconomic environments 
(Geronimus 1992). 

In light of these facts, reframing the infant mortality problem as an issue 
of women's health suggests that broader initiatives to address the health and 
well-being of women are essential. A focus on improving the health status of 
all women in the United States will require more than the provision of a right 
to health services. Changes include legal protection under the law, the right to 
choose abortion, support for work, maternity leave, and child care. Attention to 
the provision of cost-effective health promotion programs, including nutrition, 
smoking cessation, drug addiction treatment, and other services that typically 
fall outside the health care system. 

While there is some evidence for intergenerational effects of deprivation 
(Emanuel 1986), the strongest evidence is that a woman's health prior to preg- 
nancy is a substantial predictor of outcome and therefore should be the focus 
of research and intervention. In many ways the continuing focus on prenatal 
care as a preventive strategy epitomizes a strategy that comes too late to make a 
large impact on birth outcomes such as low and very low birth weight (Kempe 
et al 1992). 


Race, Ethnicity, and Infant Mortality 

We have described the persistent racial and ethnic disparities in infant mortal- 
ity rates in the United States. Despite much speculation, no evidence suggests 
that social, economic, or racial/ethnic disparities in the risk of death in infancy 
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are caused by genetic differences. Any discussion of racial and ethnic differ- 
ences in birth outcomes needs to acknowledge the limited extent to which social 
and economic variables—including racial discrimination—can be disentangled 
from crude proxies for biologically determined pathologic pathways (Krieger 
et al 1993, Herman 1996). The racial and ethnic categorizations used in most 
research on infant mortality in the United States should be seen as “proxies 
for specific historical experiences and a powerful marker for current social and 
economic conditions" (Williams & Collins 1995). Because of the unclear defi- 
nition and measurement of "race" and "ethnicity" (Jones et al 1991) in virtually 
all large studies of infant outcomes and the confounding with a multitude of 
environmental differences, the potential causal role of "racial" and "ethnic" 
differences in the generation of disparate infant mortality rates is an issue that 
may be poorly served by current analytic approaches. 

The pervasive effects of racist attitudes and beliefs and the legacy of slavery 
in the United States can often be difficult to measure, but the historical evidence 
is quite clear in documenting the profound influence of these forces upon the 
life chances of minorities (Krieger et al 1993, Williams 1996). It is important 
to remember that the racial differences noted for infant mortality are similar to 
differences documented for virtually all causes of mortality in the United States. 

We have noted continuing black/white disparities in infant mortality in the 
United States. Other studies document the elevated rates of infant mortal- 
ity among Native Americans (VanLandingham & Hogue 1994) and Hispanics 
(Bacerra et al 1991), and the important errors and inconsistencies in coding 
racial and ethnic categories that occur (Hahn et al 1992). 

The roots of biologically determined explanations for disparities in infant 
mortality lie in both technical and political arenas. For example, technical ar- 
guments concerning black/white differences are based on evidence indicating 
that blacks experience a lower mean birth weight than do whites, while at 
the same time blacks experience lower birth weight-specific infant mortality 
among small infants (Buehler et al 1987). Some analysts have argued that 
a lower mean birth weight among blacks should require an adjustment when 
compared with white birth-weight distributions, thus reducing differences in 
low birth-weight rates (Wilcox & Russel 1983). This argument, while useful 
in some settings, has been widely misinterpreted as suggesting that the racial 
difference in mortality should similarly be adjusted and, thereby, technically 
reduced as well. This is flawed in two respects. First, the focus on mean birth 
weights ignores the fact that mean differences are not closely related to risk; 
rather, it is extreme birth weights (particularly extremely low birth weight) that 
are most closely related to the risk of infant death. Furthermore, black infant 
mortality is higher than white infant mortality because of the frequency of ex- 
tremely low birth weight and associated prematurity (Rowley 1995). Itis useful 
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to remember here that the mean birth weight of blacks in the United States is 
similar to that for the Japanese, the population with the lowest infant mortality 
rate in the world; extreme low birth weight in Japan is relatively rare. 

Second, this interpretation does not recognize that any adjustment in birth- 
weight distribution that reduces racial differences requires a compensatory ad- 
justment in birth weight-specific mortality that increases racial differences. 
Adjustment for mean birth weight, therefore, can never reduce the actual excess 
mortality that occurs in blacks. The political utility of “biologically” determined 
racial disparities in infant mortality is derived from its potential exculpatory 
character in assessing the role of social and policy-based influences. To the 
extent that “biology” can be said to determine the scale of racial disparities in 
infant mortality, the role of social factors and health care provision is to that 
same extent removed from practical consideration. Together, the technical and 
political aspects of this issue have generated considerable confusion concerning 
how the social epidemiology of racial disparities in infant mortality has been 
understood in the deliberation of ameliorative public policies. 

Because of a long-standing interest in the complexities of race, ethnicity, 
and social stratification in the United States, sociologists can contribute signifi- 
cantly to a more accurate understanding of racial and ethnic inequities in infant 
outcomes. The conceptual and daily dimensions of racial and ethnic differences, 
including discrimination and racism, their definition and measurement, and how 
risks and interventions can operate within different populations are all issues 
that demand greater study. The analysis of racism (Williams 1996), migration 
(Cabral et al 1990), and social and economic risks within populations (see e.g. 
Schoendorf et al 1992) and the outcomes of an expanding population of mixed 
racial and ethnic children (Collins & David 1993) constitute promising avenues 
of research. 


SUMMARY 


Current patterns of infant mortality in the United States provide a useful illus- 
tration of the dynamic interaction of underlying social forces and technological 
progress in determining trends in health outcomes. Sociological research con- 
cerning infant mortality in the future needs to acknowledge the power of both 
social and economic forces and health care technology in the generation of 
disparities in infant mortality. 
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ABSTRACT 

Sociologists have not addressed directly the normative issues that constitute the 
core of medical ethics as an intellectual discipline. They have, however, con- 
tributed to a “realist” critique of medical ethics in practice. In regard to the 1n- 
stitutionalization of “autonomy” around the principle of informed consent, they 
have noted widespread indifference on the part of patients, considerable variation 
among settings, and a persistent ability of physicians to deflect challenges to their 
authority. In regard to failed efforts to institutionalize “justice,” sociologists have 
noted both the difficulty physicians experience moving from individual clinical 
decisions u a recognition of the collective consequences of those decisions and 
the social structures that shape allocation decisions along dimensions orthogonal 
to ethical concerns. Most importantly, however, medical ethics is an arena in 
which sociologists can revisit, in new form, old issues about the doctor-patient 
relationship, the relaticnship between medicine and gender, the meaning of death 
and dying, and the character of medical professionalism. 


INTRODUCTION 


In the extended family of scholars that has formed over the last three decades 
around what has come to be known as medical ethics, sociology is a poor rela- 
tive. Medical ethics is an intellectual discipline, but sociology has not assumed 
a prominent position in that discipline, and medical ethicists themselves have 
typically paid little attention to sociology. Medical ethics is also a set of in- 
stitutionalized principles and procedures that have helped shape new modes of 
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medical practice. Sociologists have assumed a position as sympathetic critics 
of these new policies, but they are by no means their most important—let alone 
their only—critics. Nonetheless, medical ethics—both as an intellectual disci- 
pline and as a social movement—has provided sociologists an opportunity (or 
at least an excuse) to revitalize many of the long-standing preoccupations of 
medical sociology. 

In this essay, I briefly review the relationship between sociology and medical 
ethics as an intellectual discipline. I then turn to the sociological critique of the 
institutionalization of medical ethics, concentrating primarily on the successful 
institutionalization of procedures meant to ensure “autonomy” and secondarily 
on less successful attempts to institutionalize "justice." In the final section of 
the review, I suggest that the most important contributions of the sociology of 
medical ethics are neither to medical ethics as an intellectual discipline nor even 
as a mostly sympathetic critique of the institutionalization of medical ethics, 
but to the development of a distinctively sociological understanding of medical 
practice. 


Medical Ethics as an Intellectual Discipline 


No doubt, there have been self-conscious medical ethics at least since Hip- 
pocrates formulated the famous oath physicians still take on entering their 
profession. Even “modern” codes of medical ethics date from the eighteenth 
century in England and the middle of the nineteenth in the United States (Berlant 
1975). This said, contemporary medical ethics, in the sense of sustained reflec- 
tion on clinical decision-making based in systematic philosophy and theology, 
is probably no older than the 1954 publication of Joseph Fletcher’s Morals 
and Medicine, and it reached intellectual maturity only in the 1960s and 1970s 
with the publication of many of the field’s classic essays (Ramsey 1970, Veatch 
1976). 

Medical ethics is now a well-established discipline, with its own journals 
and institutes, although these are more often organized as centers with varying 
degrees of independence from conventional lines of university departments 
than as traditional academic units. Medical ethics has, however, been led far 
more by the parent disciplines of philosophy and theology than by sociology 
or any of the other social sciences. To some degree, sociologists and medical 
ethicists are competitors, fighting for often hotly contested space on the medical 
school curriculum, with medical ethics now displacing the social sciences. For 
the most part, however, sociology and medical ethics have treated each other 
with mutual indifference. Although a few medical ethicists have suggested 
that medical ethics would be revitalized by the inclusion of social science 
(Hoffmaster 1992, Crigger 1995, Peralman et al 1993), sociologists remain 
infrequent contributors to the leading journals of medical ethics. Even reviews 
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of sociological books directly related to ethical issues are rare in those journals, 
and those reviews that are published are often critical. Neither, however, do 
sociological journals review even the most important works of medical ethics, 
and many sociologists, including those who formulate their work as related to 
ethical issues, rarely even attempt to treat philosophical or theological issues 
on their own terms. 

The mutual indifference of medical ethicists and sociologists of medical 
ethics is, I would suggest, a matter of fundamental differences of purpose—of 
a contrast between medical ethics as a normative discipline and sociology as 
an empirical one. However, this claim needs qualification. Certainly med- 
ical ethics is more explicitly normative than sociology, and sociology, more 
explicitly empirical than medical ethics. Nonetheless, medical ethicists often 
incorporate empirical material into their analyses and recently have shown a 
growing inclination to do so. So too the sociology of medical ethics carries 
normative implications, often quite intentional, particularly (although not ex- 
clusively) in feminist discussions of medical practice (Ruzek 1979, Rothman 
1989). At most then, the difference between medical ethics and the sociology 
of medical ethics is in the form of their normative statements: Sociology is 
almost always consequentialist; in contrast, medical ethics is given much more 
to "deontological" positions, evaluating acts as right or wrong even apart from 
consequences. 

More important is the difference between medical ethics as an applied disci- 
pline and the sociology of medical ethics as an academic one. But this claim, 
too, requires qualification. Many textbooks and readers in medical ethics begin 
with discussions of general principles—utilitarianism, Kantian ethics, Rawlsian 
ethics—that require flights of abstraction unfamiliar to most sociologists (for 
example, Munson 1988). Conversely, sociologists of medical ethics seem pre- 
pared to grapple with the grubby details of hospitals, clinics, and patients’ lives 
in a way matched by few ethicists. Nonetheless, medical ethics is fairly explic- 
itly a branch of applied philosophy, and even its grandest abstractions are put to 
use in the service of formulating procedures and policies useful to problems of 
therapeutic practice, health care delivery, and medical and biological research" 
(Beauchamp & Childress 1979, vi). In contrast, sociologists tend to use even 
grubby details as a route to more analytic observations. Ironically, then, it is 
the medical ethicists, trained ın philosophy and theology and given to abstrac- 
tion, who are often worldly, and it is the sociologists of medical ethics who are 
more often other-worldly (Gray & Phillips 1995). Both medical ethicists (Arras 
1991, Kass 1990) and sociologists of medical ethics are prepared to analyze 
individual cases, but they do so for different purposes. For the sociologist, the 
case Is typically exemplary. In contrast, for the medical ethicist, like the clinical 
physician and, to some extent, the lawyer, the case is often the end point. Put 
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broadly, but with at least moderate accuracy, medical ethics may be thought 
of as the normative study of high principles for the purpose of guiding clinical 
decisions. In contrast, the sociology of medical ethics may be thought of as 
the empirical study of clinical decisions for the purpose of understanding the 
social structure of medicine. Clearly then, medical ethicists and sociologists of 
medical ethics travel much of the same terrain, but they do so traveling in dif- 
ferent directions. If they do little more than wave as they pass, rarely stopping 
even for a chat, it is because they do not have a great deal to say to each other. 


The Institutionalization of Medical Ethics: Informed Consent 


In a series of influential essays, Renee Fox (1989, 1990, Fox & Swazey 1984) 
has argued that the “ethos” of medical ethics has been dominated by an analytic 
individualism, organized around the value of “autonomy,” that assigns promi- 
nence to “the notion of contract" while relegating “more socially-oriented values 
... to a secondary status" (1989, pp. 229-30). Fox's observations do not seem 
to me an accurate characterization of medical ethics as an intellectual disci- 
pline. Like many disciplines, medical ethics is far from consensual, and values 
of “beneficence” and “justice” have always competed, at least in more con- 
ceptual writings, with “autonomy” (National Commission 1978). Moreover, 
what Fox (1989) herself, writing in the 1980s, had the foresight to recognize as 
“incipient changes in the ethos of bioethics” have assumed even more promi- 
nence as ethicists have come increasingly to stress issues of distributive justice 
in response to a growing perception of a crisis in health care resources and 
costs (Callahan 1987, Childress 1990). Nonetheless, whatever qualifications 
one might introduce to Fox’s characterization of medical ethics as an intellec- 
tual discipline, they represent a remarkably apt characterization of the ways in 
which medical ethics has been successfully institutionalized. 

There has, of course, always been a practice of medical ethics, both in the 
sense that the actions of physicians and other health care personnel carry ethical 
implications and in the sense that medical training has long carried a signifi- 
cant element of moral education (Bosk 1979). However, the last 30 years in 
the history of medical practice are distinguished by the degree to which ethi- 
cal considerations in medicine have been made explicit and subject to formal 
policies and procedures. 

The bases of contemporary, institutionalized medical ethics can be found in 
the "Nuremberg Code,” which, in response to the revelation of bizarre medi- 
cal experiments in Nazi concentration camps, enunciated the principle of the 
*voluntary consent" of human subjects as essential to the ethical conduct of 
research. However, as the historian David Rothman (1991) has argued, the 
Nazi abuses were taken more as a cautionary tale about government interfer- 
ence with science than as an indication of any parallel problems in American 
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medicine. The origin of contemporary medical ethics, then, is probably better 
dated to a series of well-publicized reports of abuses of the rights of dependent 
and vulnerable populations (poor, rural African-Americans, prisoners, soldiers, 
retarded children, and the debilitated aged) in the United States (Beecher 1966, 
Jones 1981). In responses to these revelations, particularly Beecher’s 1966 New 
England Journal of Medicine report of 22 cases of ethical abuses in medical 
research, the US Public Health Service promulgated a set of rules providing 
protection for human subjects and requiring the establishment of Institutional 
Review Boards (IRBs) to provide peer review of ethical issues in all federally 
funded research. 

Even had it been inclined to do so—which it was not—the Public Health 
Service could not have regulated clinical practice so easily as medical research. 
Nonetheless, consent also came to clinical practice, albeit more from the courts 
than from codes. To be sure, common law in the United States has long required 
the consent of patients to medical procedures. However, throughout most of the 
nineteenth century, this requirement was based primarily on the notion that the 
patient’s consent was essential to the effectiveness of procedures (Pernick 1982). 
Only in the twentieth century, particularly since the 1914 Schloendorff case, 
was the requirement of obtaining consent made a matter of rights, and only 
since the 1957 California Salgo case has there been any requirement that con- 
sent must also be “informed.” Since 1957, the doctrine of informed consent has 
been extended, both by courts and through legislation, to include an increas- 
ingly broad, “patient centered,” set of standards of what constitutes adequate 
information. Although there is significant variation from state to state, not only 
does the legal doctrine cf informed consent require physicians to disclose a wide 
range of information to patients—including the risks and benefits of procedures 
and alternatives to proposed treatments—but those requirements gain additional 
force from hospital policies that have routinized the process of obtaining consent 
(President’s Commission 1982, Katz 1984, Faden & Beauchamp 1986). 

Although based on similar principles, the development of policies bearing 
on termination of treatment has followed a more complex course, following 
somewhat different trajectories for newborns and for adults and involving not 
only both court cases and an ill-fated effort at federal regulation but also a long 
process in which hospitals themselves played a leading part in formulating pro- 
cedures. It had long been an ill-kept secret 1n medicine that physicians did not 
always make maximal efforts to maintain the life of either disabled newborns or 
critically ill adults. But the issue became public only with the 1973 publication 
of an article on “Moral and Ethical Dilemmas in the Special Care Nursery” in 
which Duff & Campbell described the procedures he and his colleagues used 
to consult with patients in deciding to withhold potentially life-sustaining treat- 
ment in a newborn nursery. (Duff is perhaps better known to sociologists for his 
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collaboration with August Hollingshead on their classic 1968 study, Sickness 
and Society.) Duff’s article signaled the beginning of a controversy that contin- 
ued through a series of so-called “Baby Doe” court cases, in which right-to-life 
advocates attempted to require surgery for a number of newborns against the 
wishes of both parents and physicians. These cases culminated in 1983, when 
the Reagan administration issued a set of regulations prohibiting discrimina- 
tion against newborns on the basis of developmental disability. Although the 
Reagan rules were eventually overturned in court, the result has been the devel- 
opment of carefully constructed guidelines that allow parents and physicians to 
decide against treating newborns, but typically only with the oversight of newly 
formed neonatal ethics committees (President’s Commission 1983). 

The development of procedures for withholding treatment from critically ill 
adults developed along a more or less parallel course, although with signifi- 
cantly less federal involvement. In 1976, a New Jersey court ruled that Karen 
Quinlan, a young woman believed to have become permanently comatose, 
could be removed from a respirator. In the same year, in a special article in 
the New England Journal of Medicine, Rabkin and his colleagues described the 
procedures used at Beth Israel Hospital to guide “the process by which deci- 
sions not to resuscitate should be made.” Since 1976, virtually every hospital 
in the country has followed Beth Israel’s lead, establishing policies for “Do 
Not Resuscitate” or DNR orders that assign priority in decision-making to the 
wishes of the patient. And in the 1990 Cruzan case, although in a finding fraught 
with ambiguities, the United States Supreme Court confirmed that physicians 
could withdraw treatment from terminally ill patients so long as—given appro- 
priate state-determined procedural safeguards—that decision is an accord with 
the patient’s desires. 

The process of institutionalization, then, has varied significantly in regard 
to experimentation with human subjects, the establishment of informed con- 
sent, withholding treatment from newborns, and withholding treatments from 
adults—with different forms of involvement by courts and the federal gov- 
ernment, with varying degrees of leadership from hospitals, and with varying 
amounts of controversy. However, in each instance, the institutionalization of 
policies has taken shape around the principles of patient autonomy and self- 
determination. In effect, “informed consent,” intended explicitly to increase 
the patient’s ability to direct decisions bearing on his or her own treatment, 
has become the master solution to ethical issues. Thus, the solution to abuses 
in medical research has been a set of procedures designed to ensure that the 
participation of human subjects is voluntary; the solution to dilemmas in the 
treatment of handicapped newborns has been to ensure that parents receive 
priority in making decisions; and the solution to dilemmas in the treatment of 
adults has been an attempt to find procedures ensuring that the patient’s own 
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wishes take priority. To be sure, many of the applications of informed consent 
are remarkably complex. How, for example, can one ensure that a “substi- 
tuted judgment” made on behalf of a comatose, no longer competent patient 
genuinely represents what that patient's wishes would have been in an unan- 
ticipated situation? How, for example, can one speak at all of the wishes of a 
newborn infant or a severely retarded adult who has never been competent? Yet, 
these complexities notwithstanding, the principle of informed consent and the 
underlying value of autonomy have themselves remained remarkably simple 
and straightforward. 

Sociologists have generally expressed sympathy for informed consent. They 
have, however, also expressed considerable skepticism about its implementa- 
tion. In summarizing a series of methodologically diverse studies, there is some 
danger of reporting greater consensus than, in fact, exists. Nonetheless, there 
does seem to be something of an emerging sociological critique of informed 
consent in practice. This critique, however, is not unique to sociology but is 
shared with a congeries of researchers so ill-defined as to lack even an agreed- 
upon name, but it includes physicians, nurses, and health policy scholars from a 
remarkably wide range cf disciplinary backgrounds. Roughly epidemiological 
in style, these researchers have joined with sociologists to make a case that: 
1. Although not matched in practice, there is a great deal of expressed support 
among physicians and other health care professionals for the principle of in- 
formed consent; 2. there is widespread indifference on the part of patients to 
the sort of participation envisioned by informed consent; 3. there are significant 
variations in practice among settings; and 4. physicians maintain significant 
authority over decisions, based primarily on claims to technical expertise. 

1. Atleastatthe level of principle, there is little resistance to informed consent 
among doctors, nurses, or other health care personnel. Perhaps the single 
most striking piece of evidence is a 1979 replication of a 1961 study of what 
physicians told cancer patients, showing a remarkable shift from near unanimity 
in not informing patients of a cancer diagnosis in 1961 to near unanimity in 
informing patients in 1979 (Oken 1961, Novack et al 1979). Similarly, both 
a Harris survey commissioned by the President's Commission on Biomedical 
Ethics (Louis Harris 1982) and a survey of matched samples of patients and 
physicians drawn from three Midwestern states (Haug & Lavin 1983) show 
widespread support among physicians for informing patients. 

Practice, however, is a different matter. The failures of 1nformed consent 
are probably best documented in regard to termination of treatment. There, a 
large body of literature based on surveys, chart review, and direct observation 
shows that patients themselves participate in probably somewhat less than one 
fifth of the decisions. Although surrogates, typically family members, par- 
ticipate more, those surrogates usually follow the lead of physicians who are 
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ill-equipped to predict the preferences of patients and are less inclined toward 
aggressive treatment than either patients or family members (Jayes et al 1993, 
Smedira et al 1990, Hanson et al 1994, Bedell & Delbanco 1984, Wren & 
Brody 1992, Dants et al 1991). Similar failures of informed consent have been 
reported in research settings (Gray 1975), adult intensive care units (Zussman 
1992), neonatal intensive care units (Guilleman & Holmstrom 1986, Anspach 
1993), and in both general medicine and general surgery (Lidz et al 1982, 1983, 
1984). As a result, the practice of informed consent becomes a “ritual” of pa- 
tient care (Bosk 1980, Lidz et al 1983, Zussman 1992) rather than the intended 
occasion for substantive discussion between doctor and patient. 

2. Despite the support for informed consent among physicians, there is wide- 
spread indifference on the part of patients. Read carefully, both the Harris and 
Haug surveys show that physicians are more inclined to inform patients than 
patients are to ask for information. More specifically, drawing on field research 
and patients’ accounts of their own illnesses, both Lidz and his colleagues 
(1982, 1983, 1984) and Schneider (1997) have shown that patients may want 
physicians to keep them informed as a matter of courtesy or to enable them 
better to comply with physicians’ recommendations. Only rarely are patients 
interested in full participation in decision-making processes. 

At the same time, clinical ethics often ignores those issues—even those moral 
issues—that most concern patients. As Schneider & Conrad (1983), Charmaz 
(1991), Kleinman (1988) and Frank (1995) have all argued, not only is informed 
consent at the periphery of patients’ experience but so, too, is patienthood more 
generally, especially for those with chronic illness inremission. As Frank (1995, 
p. 156) has suggested, “the ethical questions for members of the remission 
society are not adjudications of health care conflicts but how to live a good life 
while being ill" As a result, the kind of moral thinking that is embedded in 
the narratives of selfhood constructed by those with chronic illness is largely 
ignored in the institutionalization of medical ethics. 

3. There is considerable variation in the effectiveness of informed consent 
across settings. Informed consent has probably been most effective with hu- 
man subjects in a research setting, but even there variations are significant. 
The intense participation of patients and the close relationships between those 
patients and researchers, described by Fox (1959) in her pioneering study of an 
experimental ward prior to the institutionalization of informed consent, prob- 
ably represents a relatively rare situation. Stressing the more typical situation 
in which medical scientists leading large research teams are unconstrained by 
intense patient contact, Barber et al (1973) reported that a significant minority 
of researchers, driven by competition for recognition and reward, were at least 
prepared to ignore the mandates of conventional ethical standards. Gray (Gray 
1975, Gray et al 1978, Gray & Osterweis 1986) has also reported that in most 
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research settings informed consent is treated casually and that, in a research 
study of labor-inducing drugs, many women “mostly with low levels of edu- 
cation ... did not even realize they had been entered into a research project, 
notwithstanding their having signed a consent form" (1986, 547). 

If informed consent is unevenly effective in research settings, the failures are 
even greater in clinical settings. Lidz and his colleagues (1982, 1983, 1984), 
in particular, have argued that, as a legal doctrine, informed consent embodies 
a set of assumptions that are ill-suited to medical practice. Thus, the doctrine 
of informed consent assumes that “medical practice is discrete—that is, broken 
into distinct parts, or decision units—and that there can be consent by the patient 
to each of these individual parts" (Lidz et al 1982, p. 401). This assumption 
may make some sense in medical research, which may involve both explicit 
protocols and explicit points at which patients can decide whether to participate. 
In contrast, in clinical practice, especially in internal medicine, decision-making 
Is far less explicit and is incremental. Only in surgery, where procedures have 
clear starting points and where there is an explicit chain of command, does the 
practice of informed consent even approach a model of physicians disclosing 
information before patients “decide” what course to follow. 

4. The ability of physicians to claim authority based on their possession 
of specialized technical knowledge contributes to the ineffectiveness of in- 
formed consent. This claim is addressed both to physicians themselves, in a 
largely unself-conscious effort to reconcile expressed belief with practice, and 
to patients. In neonatal intensive care units, Anspach (1993), Guilleman & 
Holmstrom (1986), and Levin (1990) have all shown that physicians more of- 
ten ask parents to "assent to a decision members of the nursery have already 
made" than they treat parents as "the principal participants in the life-and-death 
decisions" (Anspach 1993, p. 87). Anspach argues that the "decision-making 
process itself is structured to limit and constrain parental participation" (92). 
Staff typically meet among themselves before meeting parents and then present 
a "united front,’ do not expose parents to the full range of treatment options, 
and frequently slant the prognosis depending on the course of action they have 
independently decided to follow. Moreover, staff "employ a set of strategies to 
persuade the parents to accept the statf's point of view" (96). The most com- 
mon of these is an appeal to expert authority which includes (a) reviewing test 
results (thereby appealing to the authority of technology); (b) noting that the 
baby has failed to improve; (c) assuring the parents that the nursery has done 
everything in its power to help the baby; [and] (d) letting the parents know that 
the baby is being kept alive by artificial means (97). 

The potential tension between the expressed principle of involving parents 
in decision-making and the actual practices is diffused, in part, by structural 
factors that limit physicians' contact with patients and, in part, by discounting 
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the views of “unsophisticated” parents. The result is that it is physicians, not 
parents, who are the leading decision makers. 

Zussman’s (1992) study of two adult intensive care units yielded similar 
results. As in neonatal units, physicians typically express a principled accep- 
tance of patients’ and families’ priority in matters of “values” as they bear on 
the general direction of treatment. However, physicians, in effect, neutralized 
the patient’s or family’s preferences by narrowing the range of decisions. By 
formulating decisions as technical rather than value-relevant issues, physicians 
demarcate decisions as within the proper realm of medical expertise. While 
neither the study of adult intensive care units nor studies of neonatal units deny 
that the doctrine of informed consent has empowered patients and their fami- 
lies, all show quite clearly that the empowerment of patients falls far short of 
expressed intentions. 

There is, then, a more or less standard sociological critique of informed 
consent. It 1s not, however, a philosophical critique of principles. Quite the 
reverse, there is an implicit sympathy for both informed consent and the un- 
derlying value of autonomy. Rather, it is a critique of the viability of informed 
consent, based in a structural analysis of the cultural authority of medicine and 
the social structures that prevent patients from participating in decisions bearing 
on their own care. 


Failures of Institutionalization: The Allocation 
of Scarce Resources 


Medical ethicists have probably written as much about "justice" as “autonomy.” 
To be sure, medical ethicists have tended—although with some recent signs of 
change—to treat broad questions of health care policy as beyond their scope, 
limiting themselves instead to questions of justice embedded in clinical de- 
cisions. But they developed no overarching, guiding principle equivalent to 
informed consent. The closest, perhaps, is a principle of medical utility (Knaus 
1983), based loosely on the military metaphor of battlefield triage (Teres 1989). 
But even here there is significant dissent (Annas 1985). For decisions about 
the allocation of dialysis, organ transplants, or intensive care beds, there are no 
court cases of impact equivalent to Quinlan and Cruzan. And, because of their 
more transparent relationship to policy and to competing interests, proposals 
to institutionalize methods of allocating scarce resources have found a very 
different audience from proposals for ensuring informed consent. [In a study 
of the impact of various national commissions on medical ethics, Gray (1995) 
found that those reports organized around informed consent are cited more of- 
ten in both medical journals and court cases than those reports addressed to 
questions of allocation.] However, the rapid growth of managed care, accom- 
panied by the potential conflicts such care raises between the financial interests 
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of physicians and the health interests of patients, has recently placed the allo- 
cation of resources high on the agenda of medical ethicists. On these matters, 
then, sociologists have assumed a very different stance than in regard to in- 
formed consent. Rather than showing the limited effect of new policies on how 
decisions are made, the task in regard to the allocation of scarce resources has 
been to show how decisions are made in the absence of explicit policies. 

There is not a single history of clinical decisions about the allocation of scarce 
resources so much as a series of discrete histories, each organized around a 
particular set of issues. Organ transplentation has proceeded in fits and starts, 
marked by frequent successes, occasional failures (the artificial heart, animal- 
to-human transplants), spurts of activity and clinical moratoria, massive public 
interest, and both the celebration and demonization of pioneers in the field. 
Dialysis, a life-sustaining treatment for those with what would otherwise be 
terminal kidney disease, flared briefly as a public controversy in the 1960s before 
the passage of special legislation making treatment available to essentially all 
who need it. The allocation of beds in intensive care units, both neonatal and 
adult, has never become a public issue, but with the cost of intensive care 
accounting for one percent of the nation’s GNP, it is perhaps the most pervasive 
site of allocation decisions. 

Striking as the differences are among dialysis, organ transplants, and inten- 
sive care units, sociologists have demonstrated a remarkable similarity in the 
inability of either physicians or advisory boards to impose limits on treatments. 
These points have been made particularly powerfully by Fox & Swazey (1978, 
1992) in a series of studies conducted over a span of more than a quarter century. 
In Spare Parts and, particularly, in The Courage to Fail, they show the difficulty 
both physicians and lay advisory boards experienced 1n maintaining consistent, 
let alone ethically rigorous, policies for denying either organ transplants or 
dialysis. In neonatal units, Guilleman & Holmstrom (1986), Anspach (1993), 
and Levin (1990) all report a strong bias in favor of aggressive treatment, even 
for high-risk infants, and to the neglect of what might be far more cost-effective 
prenatal care. Similarly, Zussman (1992) reports the difficulty physicians in 
adult intensive care units experience attempting to deny expensive treatment 
even to patients likely to be "tortured" by expensive treatments and who have 
little significant chance of surviving their hospital stay. Only a desire on the 
part of house staff to protect their colleagues from difficult admissions—a con- 
sideration driven by a sense of mutual obligation to fellow professionals rather 
than by any systematic criteria of "justice"—introduces any incentive to control 
the allocation of beds. 

Some part of this tendency to overtreatment 1s probably explained by a gen- 
eral tendency to aggressive treatment, deeply rooted in the culture of American 
medicine (Payer 1988). But an at least equally compelling explanation points to 
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the structural characteristics of American medicine: in particular, the sprawl- 
ing character of the American health care system which, whether in organ 
transplants or intensive care units, allows a great number of actors to share in 
decisions without assigning clear responsibility to any particular actor (Fox & 
Swazey 1992). In contrast to the “closed system" of British medicine, which 
operates within fixed budgets, as the philosopher Norman Daniels (1986) has 
pointed out, American physicians operate within an “open system” in which 
they have no assurance that savings incurred on one patient will remain within 
the health care system at all. Accustomed to concerning themselves with that 
which they can control, physicians concentrate on the individual case while 
neglecting collective consequences. 

As much as physicians ignore the collective consequences of their individ- 
ual clinical decisions, those consequences are real. In some instances, this 
involves decisions that taken individually are made on universalistic criteria 
but in aggregate are highly particularistic. In intensive care units (Zussman 
1992), to continue one example, house staff accept a notion of collective con- 
sequences and consistently weigh the needs of patients against each other in 
admission and discharge decisions; in contrast, private physicians consistently 
act as advocates, pressing to admit their own patients to the unit without regard 
to competing claims for beds. Although each group of physicians follows more 
or less rigorous ethical standards, the aggregate result is that wealthier patients 
with a private physician to act as an advocate are more likely to enter and remain 
in an intensive care unit than are poorer, ward patients represented by house 
staff. In other instances, the collective consequences become articulated in a 
political arena. Fox & Swazey (1978), to continue another example, argue that 
the 1972 amendment to the Social Security Act, making dialysis available on 
a virtually universal basis, was a humane response to a horror at the denial of 
life-prolonging treatment to identifiable individuals. But the consequence, they 
argue, as the cost of the dialysis program became evident, was to undermine the 
political viability of more extensive proposals for catastrophic health insurance. 

In contrast to the almost haphazard way in which allocation decisions have 
been made in other settings, the rise of prospective payment systems and man- 
aged care represent attempts to introduce systematic incentives—if not always 
systematic standards—for physicians to introduce resource considerations into 
their decision-making processes. Prospective payment refers to the effort to 
control spending by the establishment of fixed reimbursements for hospitals 
on the basis of diagnostic-related groupings (DRGs) rather than on the basis 
of costs incurred. Managed care is a general term referring to a wide variety 
of plans and organizations—including health maintenance organizations and 
preferred provider organizations—that provide some sort of review of physi- 
cians' decisions and, in some cases, financial incentives for physicians to limit 
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care. Prospective payment on the basis of DRGs applies to hospital reimburse- 
ment rather than to physician reumbursement. As a result, although prospective 
payment has received a great deal of attention from health policy analysts, it 
has excited much less attention among medical ethicists focused on developing 
appropriate criteria for clinical decisions. In effect, prospective payment puts 
the physician in the position of a defender of the patient against a hospital with 
new incentives to limit both length of stays and intensity of treatment. Man- 
aged care, however, is different. By introducing sometimes direct financial 
incentives for physicians to limit care, managed care threatens the “fiduciary 
ethic" (that is, the primary responsibility of the physician to their patient's well 
being), which is the longstanding basis of physicians' claims to trustworthiness 
(Gray 1993, Morreim 1991, Wolfe 1994, Rodwin 1993) 

Although there is not yet a great deal of sociological research on either 
DRGs or managed care, what research there is suggests that the effects of 
financial incentives, while powerful, are indirect. Gray (1993), for example, 
did not find a great deal of difference in the behavior of for-profit and not- 
for-profit organizations. Similarly, Hillman and his colleagues (1989) found 
that financial incentives are more powerful in group model HMOs than in 
independent physician associations, while Notman and his colleagues (1987) 
found that physicians reduced lengths of stay in response to the introduction of 
DRGs despite both a hostility to financial considerations and a lack of detailed 
knowledge of DRGs. Put somewhat differently, these findings suggest that 
physicians respond to financial incentives within a collective context rather 
than as income-maximizing individuals, that their responses are mediated by 
the networks of physicians in which they are embedded and possibly by long- 
term career contingencies. 

To summarize, three points, at least, have recurred across a wide range of 
issues: 1. a recognition of the difficulty physicians have moving beyond indi- 
vidual clinical decisions to an understanding of the collective consequences of 
those decisions; 2. a recognition, often omitted from the discourse of medical 
ethics, of the complex and powerful set of social relations that shape decisions 
when they are made; 3. an insistence that, despite the indifference of physicians, 
those consequences are both real and powerful, the collective consequences of 
their individual decisions. 


The Sociological Uses of Medical Ethics 


The critique of the institutionalization of medical ethics—both its successes 
and its failures—is distinctively sociological. But it is not unique to sociology, 
and it 1s a critique that responds to an agenda set more by medical ethicists, 
hospitals, and health professionals than by sociologists themselves. There is, 
however, another agenda that emerges much more clearly from sociologists 
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themselves. This is, in part, a matter of using ethical issues in medicine as the 
site for exploring general, theoretical issues in sociology. Chambliss (1996), 
for example, examines nurses' views of life and death decisions as an instance 
of the routinization of moral judgment, as an example of how organizations 
make the extraordinary seem ordinary. Similarly, Anspach (1987, 1993) exam- 
ines physicians’ and nurses’ differing responses to critically ill newborns in a 
neonatal intensive care units as an instance of the "ecology of knowledge,” of 
the ways in which differential locations within an organization produce selec- 
tive views of the same "reality." Only slightly farther afield, Bosk (1980) looks 
at the management of medical mistakes as an instance of responses to deviance 
more generally. And running through virtually all of the sociology of medical 
ethics are reflections on the social structure of occupations and professions. 

In invoking sociological theory, I do not mean, however, to invoke the tired 
distinction between sociology in medicine and sociology of medicine (Strauss 
1957). To be sure, there are significant differences between a sociology that 
follows the agenda of medicine (or medical ethics) and one that follows the 
theoretical agenda of its own discipline. But there is an alternative to either 
of these approaches: a sociology that looks at substantive issues in medicine but 
that does so in its own terms, with its own preoccupations, rather than following 
the lead of other disciplines. 

Most fundamentally, the rise of institutionalized medical ethics has trans- 
formed the doctor-patient relationship. To be sure, the effects of efforts to 
empower patients seem limited at best: the ability of physicians to neutral- 
ize the newly won rights of patients through appeals to technical expertise are 
simply a specific instance, under conditions of unusually explicit challenge, of 
the more general process by which physicians dominate medical encounters 
(Waitzkin 1982, Fisher 1986). At the same time, however, the self-conscious 
application of law and policy to what was once a more private encounter makes 
clear how difficult it is to understand that relationship in terms of an ahistorical 
functionalism (Parsons 1951, Szasz & Hollander 1956). More specifically, the 
entrance of medical ethics into the doctor-patient relationship seems to be ac- 
celerating a movement to a more contractual model in which doctor and patient 
meet with a more limited set of obligations than was once the case (Zussman 
1992, Fox & Swazey 1993, Zerubavel 1980). 

The ethical issues accompanying the development of new reproductive tech- 
nologies have also become a critical point for redefining the relationship, both 
substantive and symbolic, between women and medicine. Despite their frequent 
appearance in highly publicized court cases and in the quieter efforts of hospi- 
tals to develop protocols for their use, it is still far from clear how American 
medicine will treat genetic counseling, in vitro fertilization, and surrogate moth- 
erhood (Bosk 1992, Kolker & Burke 1994, Rothman 1986)—or how American 


MEDICAL ETHICS AND DECISION-MAKING 185 


women will respond to that treatment. Nonetheless, it is clear that each will 
become an arena in which the meaning of motherhood and the value of children 
will be fought out. Moreover, all suggest a remedicalization of pregnancy at 
precisely the moment that other trends are pushing toward its demedicalization. 

So, too, medical ethics is the site at which a transformation in the meaning 
of death seems to be in process. The development of explicit procedures for 
terminating treatment seems to have ended the type of blatant discrimination 
between rich and poor, African-American and European-American, once re- 
ported by Sudnow (1967) as a pervasive feature of emergency room medicine 
(Crane 1975). More generally, though, DNR protocols have become the site 
for a redefinition of death as a loss of consciousness rather than a loss of phys- 
iological functioning. And, more generally yet, although death is surely still 
far from an integral part of hospital life, the very establishment of protocols 
for terminating treatment makes death more visible and thus helps undermines 
the extensive efforts hospitals once made to hide death by isolating the dying 
patient both psychically and physically (Glaser & Strauss 1965, Aries 1981). 

Not least, the institutionalization of medical ethics is helping to transform 
the relationship among practitioners. If, as Freidson (1975) could once report, 
collegial oversight in medicine was both rare and exercised lightly, this is no 
longer so clearly the case. Although the rise of managed care is almost surely 
the major factor accounting for the extension of collegial control, the institu- 
tionalization of medical ethics provides a language in which this control may 
be put and a series of forums in which to do so. 

Nowhere, though, are the sociological uses of medical ethics more apparent 
than in the analysis of nursing as an occupation. The rise of medical ethics has 
probably contributed less to a renewed scciological interest in nurses than has 
a feminist interest in a predominantly female occupation and perhaps a gen- 
eral tropism in sociology toward those without power (Anspach 1993, Brannon 
1994, Chambliss 1996, Fisher 1995, Marrone 1995, Wolfe 1988, Fox et al 
1990, Melosh 1982). Moreover, because nurses lack authority, they are less 
relevant to ethical decisions than are the physicians in whose shadows they 
work. If the point is simply to evaluate the effectiveness of institutionalized 
ethics, nurses are, for the most part, beside the point. But this is not the point. 
Rather, nurses' responses to ethical issues are interesting to sociologists pre- 
cisely because nurses represent a different orientation to medical care than do 
physicians. In particular, they represent not only a less technical orientation to 
medical care but also what might be called an “ethics of care.’ Where physicians 
rely heavily—and, at times, almost exclusively—on diagnostic technologies, 
nurses represent a prognostic mode that draws more on the patient's participa- 
tion in the type of interaction that can be observed or intuited only in the context 
of an ongoing relationship. While physicians' communication with patients is 
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often oriented toward transmitting information or making specific decisions, 
nurses represent a style of communication with patients that is oriented more 
toward encouraging understanding. Where physicians are preoccupied with 
the process of cure and often indifferent to the patient’s experience during that 
process, nurses represent a concern for care, particularly with the management 
of pain, emotional as well as physical. In all these senses, nurses represent 
an alternative to the kind of contractual model implicit in contemporary med- 
ical ethics. To be sure, sociologists have demonstrated uncertainty as to the 
reality of this ethos and even considerable ambivalence as to its desirability. 
Nonetheless, they have consistently used the discussion of an ethos of care as a 
means of raising issues that are more or less systematically ignored in medical 
ethicists. They have, in short, used nurses as a means of reaching beyond a 
medical ethics that is limited to specific, delimited decisions and toward what 
might be thought of as a medical morality that permeates the everyday as well 
as the unusual. 

The sociology of medical ethics is not the same as medical ethics: It is 
more empirical, less normative, and, most importantly, sets its scope beyond 
an examination of the clinical decisions that preoccupy conventional medical 
ethicists. Much of the sociology directed most explicitly to medical ethics has 
contributed to a sympathetic critique of institutionalized procedures and poli- 
cies meant to ensure ethical standards in clinical decisions. But the sociology 
of medical ethics, I have suggested, is at its best when it goes beyond clinical 
decisions to an examination of the structure of American medicine. I would ar- 
gue, then, that the sociology of medical ethics is not—and should not be—about 
medical ethics so much as about the social organization of medicine more gen- 
erally. Sociologists have long noted a remarkable cultural authority as one of 
the distinguishing marks of the medical profession (Freidson 1970, Starr 1982, 
Larson 1977, Conrad & Schneider 1980). The rise of institutionalized medical 
ethics—no matter how limited in practice—is both a symptom and source of a 
waning of that authority. Medical ethics, then, is new arena in which to make 
sense of new developments in old issues. This is, in part, a matter of reclaiming 
issues that have been formulated in narrowly ethical terms for a broader socio- 
logical agenda. But it 1s also, in part, a matter of treating the institutionalization 
of medical ethics as a source for an ongoing social transformation of American 
medicine. If the task of the sociology of medicine is to make sense of the 
doctor-patient relationship, the relationship between medicine and gender, the 
meaning of death and dying, and the character of medical professionalism, it 
cannot do so without paying attention to medical ethics. 


Visit the Annual Reviews home page at 
http://www.annurev.org. 


MEDICAL ETHICS AND DECISION-MAKING 


Literature Cited 


Annas GJ 1985. The prostitute, the playboy, 
and the poet rationing schemes for organ 
transplantation Am J Public Health 75:187- 
89 


Anspach R 1987. Prognostic conflict in life- 
and-death decisions. the organization as an 
ecology of knowledge. J Hecith Soc. Behav 
28 215-31 

Anspach R. 1993 Deciding Who Lives. Fate- 
ful Choices in the Intenstve-Care Nursery. 
Berkeley: Univ. Calif. Press 

Anes P 1981. The Hour of Our Death New 
York Knopf 

Arras JD 1991. Getting down to cases’ the re- 
vival of casusstry in bioethics. J Med. Philos 
16:29-51 

Barber B, Lally J, Makarushka JL, Sullivan D. 
1973. Research on Human Subjects. Prob- 
lems of Social Control in Medical Exper- 
imentation New York. Russell Sage 263 


pp 
Beauchamp T, Childress JF 1979. Principles of 
Biomedical Ethics. New York: Oxford Univ. 


Press 

Bedel SE, Delbanco TL 1984 Choices about 

cardiopulmonary resuscitation in the hospi- 
do physicians talk with patients? 
N. Engl, J. Med. 310 1089-953 

Beecher HK 1966. Ethics and clinical research. 
N. Engl. J. Med. 174:1354—60 

Berlant JL. 1975. Professional Monopoly. A 
Study of Medicine in the United States and 
Great Britain Berkeley Univ. Calif Press 

Bosk CL. 1979 Forgive and Remember: Man- 
aging Medical Failure. Chicago. | Univ. 

cago Press 

Bosk CL 1980 Occupational rituals in pa- 
tient management. N Engl. J. Med. 303 7i- 
76 

Bosk CL. 1992. All God's Mistakes: Genetic 
Counselling in a Pediatric Hospital. Chicago: 
Univ. Chicago Press. 195 pp. 

Brannon RL. 1994 Intensifying Care. The Hos- 
pital Industry, Professionalization, and the 
Reorganization of the Nursing Labor Pro- 
cess Amutyville, NY: Baywood. 185 pp 

Callahan D. 1987. Setting Iarsts: Medical 
Goals in an Aging Society. New York: Sı- 
mon & Schuster. 256 pp. 

Chambliss DF. 1996. Beyond Caring. Hospi- 
tals, Nurses and the Social Organization of 
Ethics. Chicago. Univ. Chicago Press 

Charmaz K 1991 Good Days, Bad Days. 
The Self and Time in Chroniz Illness. New 
Brunswick, NJ’ Rutgers Univ. Press. 311 


p 
Childress J 1990 The place of autonomy in 
bioethics. Hastings Cent Rep 20:12-17 


187 


Conrad P, Schneider J 1980. Deviance and 
Medicalizaton St Louis: Mosby 

Crane D. 1975 The Sanctty of Social Life. New 
York: Russell Sage 

Cngger BJ 1995 Bioethnography: fieldwork 
in the lands of medical ethics Med. Anthro- 
pol Q. 9.400-17 

Damels N 1986 Why saying no to patients in 
the United States is so hard, N Engl. J. Med. 
314:1380—83 

Danis M, Southerland LI, Garrett J, Smith JL, 
Hielema F, et al 1991 A prospective study 
of advance directives for life-sustaining care 
N. Engl. J. Med 324:882-88 

Duff R, Hollingshead A. 1968 Sickness and 
Society New York: H & Row 

Duff RS, Campbell AGM 1973. Moral and 
ethical dilemmas in the special care nursery 
N Engl J Med. 289 890-94 

Faden RR, Beauchamp TL 1986. A History and 
Theory of Informed Consent. New York: Ox- 
ford Univ Press 

Fisher S. 1986. In The Patient's Best Interest: 
Women and the Politics of Medical Decisions. 
New Brunswick, NJ. Rutgers Univ Press 
214 pp 

Fisher S 1995 Nursing Wounds: Nurse Practi- 
toners, Doctors, Women Patients and the Ne- 
gotiation of Meaning. New Brunswick, NI: 
Rutgers Univ. Press. 259 pp. 

Fletcher J. 1954. Morals and Medicine. Prince- 
ton, NJ: Princeton Univ. Press 

Fox R 1959 Experiment Perilous. Glencoe, IL: 
Free Press 

Fox R 1989. The Sociology of Medicine En- 
glewood Cliffs, NJ. Prentice Hall 

Fox RC 1990. The evolution of American 
bioethics’ a sociological perspective. In So- 
cial Science Perspectives on Medical Ethics, 
ed. G Weiss, pp. 201-17. Dordrecht: Kluwer 

Fox RC, Aiken LH, Messikomer CM. 1990. The 
culture of caring: AIDS and the nursing pro- 
fession Milbank Q. 68:226—56 

Fox RC, Swazey JP 1978 The Coura 
Chicago’ Univ. Chicago Press. 2n 

Fox RC, Swazey JP. 1984. Medical morality 
is not bioethics—rmedical ethics in China 
and the United States. Perspect Biol. Med. 
27:336—-60 

Fox RC, Swazey JP 1992. Spare Parts: Organ 
Replacement in American Society, New York 
Oxford Univ. Press. 254 pp. 

Frank AW 1995 The Wounded Storyteller: 
Body, Illness, and Ethics Chicago: Univ, 
Chicago Press 213 pp 

Freidson E. 1970 Profession of Medicine New 
York Harper & Row 

Freidson E. 1975. Doctoring Together: A Study 


ded e to Fail 


188 ZUSSMAN 


of Professional Social Control. Chicago: 
Univ. Chicago Press 

Glaser B, Strauss A. 1965. Awareness of Dying. 
Chicago: Aldine 

Gray BH. 1975. Human Subjects in Medical 
Experimentation’ A Sociological Study of 
the Conduct and Regulation of Clinical Re- 
search. New York: Wiley Intersci. 

Gray BH. 1993. The Profit Motive and Pa- 
tient Care: The Changing Accountability of 
Doctors and Hospitals, Cambndge: Harvard 
Univ. Press 

Gray BH. 1995. Bioethics commissions: What 
can we learn from past successes and fail- 
ures? In Society’s Choices. Social and Eth- 
ical Decision Making in Medicine, ed. RE 
Bulger, EM Bobby, hv Fineberg, pp. 261— 
ra) Washington, DC: Natl. Acad. Press. Inst 


Gray BH, Cooke RA, Tannenbaum AS 1978. 
Research involving human subjects. Science 
201:1094—101 

Gray BH, Osterweis M. 1986. Ethical issues in a 
social context. In Applications of Social Sci- 
ence to Clinical Medicine and Health Policy, 
ed. LG Aiken, D Mechanic, pp. 543-64. New 
Brunswick, NJ: Rutgers Univ. Press 

Gray BH, Phillips SR. 1995 Medical sociol- 
ogy and health policy: Where are the con- 
nections? JHSB (Extra Issue): 170-82 

Guilleman JH, Holmstrom LL 1986. Mixed 
Blessings: Intensive Care for Newborns. 
New York: Oxford Univ Press 

Hanson LC, Danis M, Mutran E, Keenan N 
1994. Impact of patient incompetence on 
decisions to use or withhold life-sustaming 
treatment. Am. J. Med. 97.235-41 

Harris L, & Assoctates. 1982. Views of in- 
formed consent and decisionmaking. par- 
allel surveys of physicians and the public 
In President's Commission for the Study of 
Ethical Problems in Medicine and Biomedi- 
cal and Behavioral Research, Making Health 
Care Decisions Vol. 2. Appendices, Empin- 
cal Studies of Informed Consent, pp. 17-314. 
Washington, DC: USGPO 

Haug MR, Lavin B. 1983. Consumerism in 

edicine Beverly Hilis, CA: Sage 

Hillman AL, Pauly MV, Kersten J 1989, 
How do financial incentives affect physi- 
clans' chinical decisions and the financial per- 
formance of health maintenance - 
tons? N Engl J. Med. 321:86-92 

Hoffmaster B. 1992. Can ethn hy save 
the life of medical ethics? Soc Sci Med 
35°1421-31 

Jayes RL, Zimmerman JE, Wagner DP, Draper 
BA, Knaus WA. 1993. Do-not-resuscitate or- 
ders in intensive care units current prac- 
tices and recent changes. J. Am. Med. Assoc 
270 2213-17 


Jones JH. 1981. Bad Blood: The Tuskegee 
Syplulis Experiment New York Free Press. 
272 pp. 

Kass LR 1990. Practicing ethics: Where's the 
action? Hastings Cent Rep. 20:5—12 

Katz J. 1984. The Silent World of Doctor and 
Patient New York Free Press 

Kleinman A. 1988. The Iliness Narratives. Suf- 
fering, Healing, and the Human Condition. 
New York: Basic Books 

Knaus W, Draper E, Wagner D. 1983. The use 
of intensive care: new research initiatives and 
their implications for national health policy. 
Milbank Mem Fund Q. 61:561—83 

Kolker A, Burke BM 1994, Prenatal Testing: 
A Sociological Perspective. Westport, CT. 
Bergin & Garvey. 221 pp. 

Larson MS. 1977. The Rise of Professionalism 
Berkeley: Univ. Calif. Press 

Levin BW. 1990. International perspectrves on 
treatment choice 1n neonatal nurseries. Soc. 
Sci Med 30 901-12 

Lidz CW, Meisel A, Holden JL, Marx JH, 
Munetz M. 1982 Informed consent and the 
structure of medical care. In President's 
Commission for the Study of Ethical Prob- 
lems in Medicine and Biomedical and Behav- 
ioral Research, Making Health Care Deci- 
sions Vol. 2. Appendices, Empirical Studies 
of Informed Consent, pp. 317—410. Washing- 
ton, DC: USGPO 

Lidz CW, Meisel A, Osterweis M, Holden JL, 
Marx JH, Munetz M. 1983 Barriers to in- 
ja consent. Ann. Intern. Med. 99 539- 

Lidz CW, Meisel A, Zerubavel E, Carter M, Ses- 
tak M, RothL 1984. Informed Consent New 
York. Guilford 

Marrone C. 1995 Home again: professional 
and patient responsibilities in home health 
care nursing. PhD Diss., Dep. Sociol., State 
Univ. New York-Stony Brook Unpublished 

Melosh B. 1982 The Physician's Hand 
Philadelphia. Temple Univ. Press 

Morreum EH 199]. ing Act: The New 
Medical Ethics of Medicine’s New Eco- 
nomics Boston. Kluwer Acad. 

Munson R 1988 Intervention and Reflection 
Basic [ssues in Medical Ethics. Belmont, CA. 
Wadsworth 3rd ed. 602 pp 

National Commission for the Protection of Hu- 
man Subjects of Biomedical and Behavioral 
Research. 1978 The Belmont Report: Ethi- 
cal Principles and Guidelines for the Protec- 
tion of Human Subjects in Research. Wash- 
ington, DC. USGPO 

Notman M, Howe KR, Rittenberg W, Bndgham 
R, Holmes MR, Rovner DR 1987 Social pol- 
icy and professional self-interest. physician 
responses to DRGs Soc Sci. Med 25:1259— 
67 


MEDICAL ETHICS AND DECISION-MAKING 


Novack DH, Plumer R, Smith RL, Ochitill H, 
Morrow GD, Bennett JM. 1979. Changes ın 
physicians’ attitudes toward telling the can- 
cer patient. JAMA 241:897—900 

Oken D 1961. What to tell cancer patents. 
JAMA 175:1120~28 

Parsons T 1951. The Social System. Glencoe, 
IL: Free Press 

Payer L. 1988 Medicine and Culture. New 
York: Penguin 

Peralman RA, Miles SH, Arnold RM 1993 
Contributions of empirical research to medi- 
cal ethics Theor. Med. 14:197—210 

Pernick MS 1982. The patient's role 1n med- 
ical decisionmaking: a social history of in- 
formed consent ın medical therapy. In Pres- 
ident's Commission for the Study of Ethical 
Problems in Medicine and Biomedical and 
Behavioral Research. 1982 Making Health 
Care Decisions Vol. 3. Appendices, Studies 
on the Foundations of Informed Consent, pp 
1-35. Washington, DC. USGPO 

President's Commission for the Study of Ethical 
Problems 1n Medicine and Biomedical and 
Behavioral Research. 1982 Making Health 
Care Decisions. Vol 1 Report The Ethi- 
cal and Legal Implications of Informed Con- 
sent in the Patient-Practitioner Relationship 
Washington, DC USGPO 

President's Commission for the Study of Eth- 
ical Problems in Medicine and Biomedical 
and Behavioral Research. 1983 Senously 
1i newborns. In Deciding to Forego Lafe- 
Sustairing Treatment. Ethical, Medical and 
Legal Issues in. Treatment Decisions, pp 
197-230 Washington, DC: USGPO 

Rabkin MT, Gillerman G, Rice NR. 1976. Or- 
ders not to resuscitate. N. Engl. J Med. 
295.364—66 

Ramsey P. 1970 The Patient as Person. New 
Haven: Yale Univ. Press 

Rodwin MA. 1993. Medicine, Money, and 
Morals: Physicians’ Conflicts of Interest. 
New York: Oxford Univ. Press 

Rothman BK. 1986. The Tentative Pregnancy. 
New York: Viking 

Rothman BK. 1989 Recreating Motherhood. 
Ideology and Technology in a Patriarchal So- 
ciety New York: Norton 

Rothman D. 1991. Strangers at the Bedside. 
New York: Basic Books 


189 


Ruzek SB. 1979. The Women's Health Move- 
ment: Feminist Alternatives to Medical Con- 
trol New York: Praeger. 350 pp 

Schneider CE. 1997. Patients, Doctors, and De- 
cisions. Rethinking Autonomy New York. 
Oxford Univ. Press In press 

Schneider JW, Conrad P 1983. Having 
Epilepsy: The Experience and Control of Iil- 
ness Philadelphia: Temple Univ. Press 280 


pp. 

Smedira NG, Evans BH, Grais LS, Cohen NH, 
Lo B, et al. 1990. Withholding and with- 
drawal of life support from the cntically ill. 
N Engl. J. Med. 322:309—15 

Star P 1982. The Social Transformation of 
American Medicine. New York: Basic Books 

Strauss R. 1957 The nature and status of 
TERE, sociology. Am Sociol. Rev. 22:200- 

Sudnow D 1967. Passing On: The Social Or- 
ganızation of Dying. Englewood Cliffs, NJ. 
Prentice-Hall 

Szasz T, Hollander M 1956 Thomas Szasz 
and Marc Hollander basic models of 
the doctor-patient relationship Arch Intern. 
Med. 97:585-92 

Teres D 1989 Triage: an everyday occurrence 
in the intensive care unit. In Rationing of 
Medical Care for the Critically Ill, ed. MA 
Strosberg, IA Fein, JD Carroll, pp. 70-75. 
Washington, DC: Brookings Inst. 

Veatch R 1976. Death, Dying and the Biologi- 
cal Revolution, New Haven: Yale Univ. Press 

Waitzkin H. 1987. The Politics of Medical En- 
counters: How Patients and Doctors Deal 
with Social Problems. New Haven: Yale 
Univ. Press. 311 pp. 

Wolf ZR 1988. Nurse's Work, the Sacred and 
the Profane. Philadelphia. Univ Penn. Press 

Wolfe SM, 1994. Health care reform and the fu- 
ture of physician ethics. Hastings Cent. Rep. 
24.28-41 

Wren K, Brody SL 1992. The do-not- 
resuscitate order in the emergency depart- 
ment Am. J Med. 92:129--33 

Zerubavel E. 1980 The bureaucratization of re- 
sponsibility the case of informed consent. 
Bull. Am. Acad Psychiatr. Law 8:161—67 

Zussman R. 1992. Intensive Care: Medical 
Ethics and the Medical Profession. Chicago: 
Univ. Chicago Press 


Annu Rev Sociol 1997 23 191—214 
Copynight (C) 1997 by Annual Reviews Inc All rights reserved 


SOCIOLOGICAL RATIONAL 
CHOICE THEORY 


Michael Hechter 
Department of Sociology, University of Arizona, Tucson, Arizona 85721; 
e-mail: hechter@u.arizona.edu 


Satoshi Kanazawa 
Department of Sociology, Uris Hall, Cornell University, Ithaca, 
New York 14853-7601; e-mail: Satoshi.Kanazawa G cornell.edu 


KEY WORDS.  macrosociology, micro-macro link, general theory, empirical research 


ABSTRACT 

Although rational choice theory has made considerable advances in other social 
sciences, its progress in sociology has been limited. Some sociologists’ reser- 
vations about rational choice arise from a misunderstanding of the theory. The 
first part of this essay therefore introduces rational choice as a general theoretical 
perspective, or family of theories, which explains social outcomes by construct- 
ing models of individual action and social context. “Thin” models of individual 
action are mute about actors' motivations, while "thick" models specify them ex 
ante. Other sociologists' reservations, however, stem from doubts about the em- 
pirical adequacy of rational choice explanations. To this end, the bulk of the essay 
reviews a sample of recent studies that provide empirical support for particular 
rational choice explanations in a broad spectrum of substantive areas in sociol- 
ogy. Particular attention is paid to studies on the family, gender, and religion, for 
these subareas often are considered least amenable to understanding in terms of 
rational choice logic. 


INTRODUCTION 


In the last decade rational choice theory has gained influence and visibility in 
many of the social sciences and in related disciplines such as philosophy and 
law. To appreciate just how rapidly its influence has spread, consider political 
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science, a discipline in many respects similar to sociology. According to one 
estimate, the proportion of articles based on rational choice premises published 
in the American Political Science Review rose from zero in 1957 to nearly 
40% in 1992 (Green & Shapiro 1994, p. 3). Job candidates specializing in 
rational choice now command a hefty premium in American political science 
departments. 

Given its own disciplinary history, which is often characterized in relentlessly 
holistic terms, sociology would appear to be a most unpromising terrain for the 
spread of rational choice ideas and methods. Indeed, resistance to the approach 
has been notable (Baron & Hannan 1994, Petersen 1994). No American soci- 
ology department currently specializes in rational choice; only one even offers 
its graduate students a concentration in it. No jobs are listed for specialists in 
the area. This is in contrast to the situation in Western Europe, particularly in 
the Netherlands, Germany, and Sweden, where rational choice is considerably 
more institutionalized. 

Yet many sociologists, like the character in Moliére’s Bourgeois Gentil- 
homme who was startled to learn that he was speaking prose, unwittingly rely 
on rational choice mechanisms in their own research. Signs of the growing 
acceptance of sociological versions of rational choice can be found in the es- 
tablishment of the journal Rationality and Society, and in the regular proceed- 
ings of lively sections devoted to the approach in both the American and the 
International Sociological Associations meetings. 

Some of the skepticism about rational choice among sociologists arises from 
misunderstanding. One criticism of rational choice focuses on the lack of 
realism in its assumption that we calculate the expected consequences of our 
options and choose the best of them. A vast body of social research reveals that 
people often act impulsively, emotionally, or merely by force of habit. Think 
how agonizing decisions about jobs, spouses, and children can be. Were people 
the informed and calculating agents that rational choice theorists assume them 
to be, such decisions would not be particularly wrenching. Since these choices 
often take an emotional toll, it is easy to conclude that the theory is implausible. 

This conclusion, however, rests on a common misconception about the nature 
of rational choice. The theory does not aim to explain what a rational person will 
do in a particular situation. That question lies firmly in the domain of decision 
theory. Genuine rational choice theories, by contrast, are concerned exclusively 
with social rather than individual outcomes. Given that each individual acts 
rationally, will the aggregate outcome therefore be “rational” or desirable? Not 
necessarily. Regarded as stable equilibria, in which agents have no incentive 
to deviate from their course of action, given others’ behavior, social outcomes 
can be both unintended and undesirable. The overgrazing of the commons is a 
classic example of this dark side of Adam Smith’s invisible hand. 
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Figure 1 The multilevel structure of rational choice explanations (after Coleman 1990, p. 8). 





Unlike decision theory, rational choice theory is inherently a multilevel en- 
terprise (Figure 1). At the lower level, its models contain assumptions about 
individual cognitive capacities and values, among other things. Relation (2), 
for example, describes how a person who is subject to a given social structure 
at T, will behave at T, on the basis of these assumptions. Whereas relation (2) 
is necessary, it is far from the whole story. At the higher level, rational choice 
models also contain specifications of social structures. These social structures 
serve both as the social and material context (X) for individual action, and as 
new structures (Y) resulting from the actions of individuals whose behavior 
is described by the lower level assumptions (Coleman 1990, pp. 1—23). Since 
norms and other kinds of institutions enter the models both as contexts for and as 
outcomes of action, rational choice theories do not rest on premises pertaining 
exclusively to individuals. 

À second criticism of rational choice focuses on its motivational assumptions. 
Rational choice theorists regard both individual values and structural elements 
as equally important determinants of outcomes, but for methodological reasons 
their empirical applications typically place greater emphasis on social structural 
determinants. In consequence, rational choice explanations often are consis- 
tent with those of other general perspectives, such as structuralism and network 
analysis, that are usually regarded as lying well within the sociological main- 
stream (Goldthorpe 1996). One source of this methodological predilection lies 
in concerns about measurement. Values and other internal states are far more 
difficult to measure than structural constraints, which are external to individuals 
(Hechter 1992). Measuring values from the verbal responses to direct survey 
questions is problematic (Fischhoff 1991), Some progress in measuring val- 
ues is being made, however. For example, matching models (Logan 1996a) 
offer one means of measuring individual values indirectly without relying on 
peoples' responses to questions about their own internal states. 

The treatment of values in rational choice theory is due not only to method- 
ological considerations; it is also a reacticn to the complexity that is inherent in 
the multilevel nature of the theory. To reduce this complexity to more manage- 
able limits, rational choice theorists assume some model of individual action, 
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often one based on subjective-expected utility theory. They disagree about the 
most appropriate model, however. And so rational choice 1s more a rubric or a 
family of theories than a single all-encompassing one. 

Perhaps the most important division separates "thin" and "thick" models 
of individual action (Ferejohn 1991; for a fuller discussion of the differences 
between rational choice theories, see Goldthorpe's unpublished paper "Rational 
Action Theory for Sociology"). Thin rational choice models are unconcerned 
with the particular values (or goals) which individuals pursue. These models 
are based on a small number of strong assumptions: for example, that whatever 
an individual's values may be, they must be stable and transitive (if someone 
prefers a to b, and b to c, they must prefer a to c). Rational choice theories 
based on thin models—such as those usually found in economics and social 
choice theory—are highly universalistic and to that extent resemble theories in 
physics and biology concerning the optimal behavior of atoms and organisms. 

Thick models of individual action, advocated long ago by Max Weber, are 
substantively richer, for they countenance some aspects of intentionality. Since 
people have reasons for what they do, their behavior is predictable only if we 
know what motivates them. Thick models therefore specify the individual's 
existing values and beliefs. There are several means of doing so, but the most 
popular strategy has been to assume that individuals seek maximum quantities 
of exchangeable private goods such as wealth and, arguably, power or prestige. 
Wealth is commonly valued because it can be exchanged for a multitude of other 
goods in the marketplace. Thick models allow that individuals also value nonex- 
changeable goods—that some people live for the music of Mozart, and others 
for the thrill of horse racing. Indeed, the models assume that for any given indi- 
vidual, idiosyncratic values of this sort can outweigh the common one. Hence, 
without knowing each person's unique value hierarchy, individual behavior is 
unpredictable. As the size of groups increases, however, these idiosyncratic 
values tend to cancel each other out. In many circumstances the remaining 
common value permits quite accurate behavioral predictions at the collective 
level (Hechter 1994). Some rational choice theories specify other common val- 
ues that individuals pursue, such as uncertainty reduction (Friedman, Hechter & 
Kanazawa 1994), local status (Frank 1985), and distributive justice (Jasso 1990, 
1993). Other rational choice theories are beginning to model the processes that 
might be responsible for the formation of these common values (Becker 1996, 
Chai forthcoming). 

Contrary to some perceptions (England & Kilbourne 1990, pp. 160-61), 
thick rational choice theories do not necessarily assume that individuals are 
selfish agents. These theories can postulate any individual values at all, not 
excluding complete altruism. What is required 1s merely that individuals are 
self-interested, not selfish (Friedman & Diem 1990). 
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Thin models are substantively empty. They can be made consistent after 
the event, therefore, with nearly any kind of behavior. Thick ones—such as 
those postulating wealth-maximization—often are just plain wrong. To the 
degree that the idiosyncratic values are not distributed randomly in a population, 
explanations based on the pursuit of exchangeable private goods such as wealth, 
power, and prestige will fail. Since outcomes may be partially a function of 
individual motivations, predictions made on the basis of thick models can be 
mutually inconsistent. Such inconsistencies can only be resolved on the basis of 
empirical evidence. That decision theorists can routinely invalidate subjective 
expected utility theory is also troubling, even if they have yet to formulate 
an alternative superior to it. All told, the mechanisms of individual action in 
rational choice theory are descriptively problematic. Is a theory of higher- 
level outcomes invalidated by the inaccuracy of its lower-level mechanisms? 
Rational-choice theorists deny that it is (see Hechter 1996). 

Not all sociological reservations about rational choice derive from misun- 
derstanding, however. Above all, rational choice is a theory-driven enterprise: 
Many authors seem to value formal proofs of theories more than the confir- 
mation of these theories on the basis of rigorous empirical tests. Advocates 
often argue on its behalf, with not a little hauteur, by asserting that rational 
choice is the best available general theory in the social sciences. Since theories 
ultimately are judged by their capacity to account for empirical observations, 
the criticism (levelled by Green & Shapiro 1994, among others) that many 
rational choice theories are either false or remain untested cuts close to the 
bone. 

The appeal of rational choice in sociology is unlikely to increase substantially 
until the approach provides demonstrable empirical payoffs in a wide variety 
of substantive areas. In what follows, we review a sample of empirical applica- 
tions of rational choice in sociology published in English since 1988, when the 
last survey of the contributions of rational choice to macrosociology appeared 
(Friedman & Hechter 1988, pp. 204-11). Whereas most of these are explicitly 
derived from rational choice premises, we also discuss several theoretically un- 
motivated studies that report findings consistent with rational choice. Inclusion 
of this latter category of studies may be questioned. After all, if such studies 
can be carried out without reference to rational choice, then why is the theory 
necessary? One answer is that rational choice provides a heuristic framework 
that permits the diverse findings in all of these fields to be unified. Another is 
that it aids in making the logical links between different theories more explicit. 
Finally, readers should be aware that this essay is not a survey of sociological 
rational choice in its entirety, for it ignores the purely theoretical developments 
that continue to make up the bulk of research in this field (Coleman 1990 is the 
most important of these). 
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RECENT EMPIRICAL APPLICATIONS OF 
SOCIOLOGICAL RATIONAL CHOICE 


Many critics readily concede that rational choice may be appropriate for the 
analysis of voluntary exchanges in the Gesellschaft, which constitutes the tradi- 
tional terrain of economics. However, they contend that it can shed little or no 
light on social relations in the Gemeinschaft—those involving the family, reli- 
gion, and gender. For this reason, we begin our review with recent applications 
in these areas. 


The Family and Demography 

The family was once thought to fall outside the purview of rational choice 
(Sen 1983) because this primordial institution (Coleman 1993) engenders hot 
motivations—love and hate—rather than cold benefit/cost calculations. In a 
number of influential writings, Becker (1974, 1981) questioned the wisdom of 
dividing social relations into separate domains, each requiring its own distinc- 
tive analysis (Abell 1991). Following his lead, rational choice now is routinely 
applied to individuals’ family decisions. 

Two recent programmatic statements in this subarea resonate strongly with ra- 
tional choice. The concept of “family adaptive strategies” (Moen & Wethington 
1992) captures the notion of choice under constraints (with the family as the 
unit of decision-making) in life course studies in sociology. Likewise, Smith 
(1989) calls for a multilevel analysis of fertility. Many of the findings in the 
empirical literature are entirely consistent with rational choice theories. For 
example, Lloyd & South (1996) show that, net of individual characteristics, 
men’s transition to first marriage is strongly influenced by the structural con- 
straints they face, in the form of the availability of prospective partners in the 
local marriage market. South & Lloyd (1995) demonstrate that the aggregate 
divorce rates fluctuate with the structural opportunities available to spouses; 
the quantity and quality of potential new marital partners in the local marriage 
market significantly increase the risk of marital dissolution. This substanti- 
ates Becker's (1974, pp. 521—523) suspicion that men and women continue to 
search for mates even while they are married. Together, these studies 1ndicate 
that marriage and divorce are subject to the same external opportunities and 
constraints as exchanges in the market. 

Other research exemplifying how structural factors shape individual behavior 
and social outcomes (relations 1 and 2 in Figure 1) includes Brewster's (1994, 
Brewster et al 1993) study of community effects on adolescent sexual activities; 
Lee et al's (1994) analysis of the neighborhood effects on residential mobility; 
Hoem's (1991) demonstration of the impact of legal changes on marriage rates in 
Sweden; and Hoem’s (1993) findings concerning the effects of legal incentives 
on childbearing. 
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In contrast to marriage and divorce, standard microeconomic explanations of 
parenthood in the postindustrial age have not fared so well empirically. Prior 
to industrialization and the implementation and enforcement of child labor 
laws, having children made ample economic sense to parents. But why do 
individuals in the postindustrial age continue to have children when their net 
economic benefit for doing so is negative? Friedman et al (1994) propose a 
theory of uncertainty reduction to account for parenthood in advanced industrial 
societies. They argue that children in the postindustrial age serve as a means 
to reduce uncertainty for women and couples, and they predict that those who 
face greater uncertainty in their lives and/or lack alternative means to reduce 
uncertainty are more likely to become parents. This theory has been largely 
supported in its first empirical test (Wu 1996). 

Once people decide to have children, when do they stop? Yamaguchi & 
Ferguson (1995) explain rates of stopping and spacing childbearing by assuming 
that parents derive higher value from having children of both sexes than from 
having those of a single sex. Thus they predict that parents with two boys or 
two girls are more likely to have a third child than are parents of a boy and a 
girl, and the conclusions of their analysis of census data are consistent with this 
prediction. 

In addition to its contributions to the study of fertility, rational choice has also 
been applied to migration. Jasso & Rosenzweig (1990) study immigration to 
the United States as a matching process. On the one hand, potential immigrants 
must decide to emigrate from their countries of origin and choose a country 
of destination. On the other, host countries evaluate potential immigrants on 
a set of criteria and decide whom to welcome (Jasso 1988). Successful legal 
immigration takes place only when there is a match between the self-selective 
immigration decisions of foreign-born persons and the eligibility-conferring 
decisions of the host nations (Jasso & Rosenzweig 1990, p. 7). Immigration 
therefore is a process of matching between two sets of actors, each attempting 
to achieve goals and operating under constraints created by the other. 

Because migration should entail some loss of social capital (Coleman 1990), 
parental support ought to be crucial for the life prospects of children subse- 
quent to family migration. Hagan et al (1996) analyze the mobility histories of 
Toronto families and find support for Coleman's expectation that the negative 
effects of migration are more significant in families with uninvolved fathers 
and unsupportive mothers. (See Farkas 1996 for a comprehensive study of the 
effects of social and cultural capital on individuals' achievements.) 


Religion 

Religion is another subject once deemed unpromising for rational choice analy- 
sis. This has not prevented the development of a lively rational choice literature 
about it, however (Stark et al 1996; Warner 1993 is a comprehensive review). 
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The rational choice approach to religion draws a close analogy between reli- 
gion and the market economy (Stark & Bainbridge 1985, 1987, Young 1997). 
It conceives of a religious economy consisting of religious firms and religious 
consumers. Religious firms compete against one another to offer religious 
products and services to consumers, who choose between the firms. Stark & 
Bainbridge (1987) extend this analogy and offer a large number of propositions 
concerning the religious economy. 

They argue, for example, that the more pluralistic a religious economy is, 
the higher the level of religious mobilization. To the extent that there are many 
religious firms competing against each other, they will tend to specialize and 
cater to the particular needs of some segments of religious consumers. This 
specialization and catering in turn increase the number of religious consumers 
actively engaged in the religious economy. This proposition has been confirmed 
in a number of empirical studies. A positive correlation between pluralism 
and religious mobilization and commitment has been found in 17 Western na- 
tions (Iannaccone 1991), in 45 Catholic nations (finding an inverse relationship 
between Catholic monopoly and Catholic commitment) (Stark 1992), in 150 
largest US cities in 1906 (Finke & Stark 1988), ın 942 towns and cities in New 
York State from 1855 to 1865 (Finke et al 1996), and in 284 municipalities in 
Sweden (Hamberg & Pettersson 1994). Further, just as state regulation makes 
for inefficient business firms, state regulation of religion also makes for inef- 
ficient religious firms and dampens the mobilization of religious consumers 
(Chaves & Cann 1992). 

It is well known that strict churches are strong and growing in the contem- 
porary United States, whereas liberal ones are declining (Kelley 1972). For 
Iannaccone (1992, 1994) religious experience is a jointly produced collective 
good. Thus members of a church face a collective action problem. Strict 
churches, which often impose costly and esoteric requirements on their mem- 
bers, are able to solve this problem by weeding out potential free riders, since 
only the very committed would join the church in the face of such requirements. 
Finke & Stark (1992) use this theory to account for the winners and losers in 
the history of the religious economy of the United States. Consistent with the 
notion that religious experience is a collective good, Iannaccone et al (1995) 
show that churches that extract more resources from their members (in the form 
of time and money) tend to grow in membership. 


Gender 


Although critics (Ferber & Nelson 1993, Risman & Ferree 1995) have claimed 
that rational choice is oblivious to gender distinctions, and therefore is un- 
able to account for female behavior, here, too, there is disconfirming evidence 
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(Friedman & Diem 1990, 1993). Thus Brinton (1993) explains persistent gen- 
der inequalities in Japan partly as a function of parents’ (mostly mothers’) 
calculated decisions to invest more in their sons’ education than in their daugh- 
ters’. Due to the absence of a reliable social security system and the continued 
importance of a patrilineal family system, Japanese parents are financially de- 
pendent on their sons in their old age. This gives parents an interest in raising 
financially secure sons. By contrast, daughters will be incorporated into other 
families. Brinton (1989) also explains the disparity in educational attainment 
between Japanese men and women in rational choice terms. Japanese employ- 
ers tend not to hire university-educated women because they have very few 
years left to work for the company before they invariably retire to get married 
at around age 25 (Brinton 1992). University-educated women thus represent 
poorer investment prospects for the company than women with high school or 
junior college education who leave school at an earlier age and also can be hired 
at lower wage rates. Given these constraints imposed by the employers, most 
Japanese women choose not to pursue a university education. In a study that 
addresses occupational specialization by gender, Cowen (1996), an economist, 
seeks to explain why so few women have become famous painters in history. 
His evidence shows women were discouraged from pursuing careers in the 
visual arts by the constraints of marriage and childbearing. 

Supporting Hechter (1987), and later Friedman et al (1994, Hypothesis S-1), 
Treas (1993) demonstrates that a “collectivized” financial arrangement among 
married couples (joint bank accounts without separate ones for husbands and 
wives), which increases the financial dependence of the spouses on their mar- 
riage, is strongly associated with more stable unions. The general conclusion 
of Treas's study of married couples' financial organization is that principles of 
transaction cost economics accurately describe the choices of both husbands 
and wives. Treas & Giesen's (1996) analysis of a national probability sample 
from the National Health and Social Life Survey shows that, while men are 
more likely to have extramarital affairs than women, identical considerations 
of opportunities, costs, and benefits underlie both men's and women's decision 
to engage in illicit sex. 

Mackie (1996) offers an intriguing explanation for the varying fates of two in- 
stitutions affecting women in less developed societies. Whereas female genital 
mutilation persists ın Africa despite modernization, public education, and legal 
prohibition, female footbinding lasted for a thousand years in China but ended 
in a single generation. Mackie argues that each institution is a self-enforcing 
convention maintained by interdependent expectations on the marriage market. 
Footbinding in China was overturned by the establishment of associations of 
parents who pledged not to footbind their daughters nor let their sons marry 
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footbound women. Such pledge associations have not been established in 
Africa, however, and female genital mutilation persists there. 


Organizations 

At least since Coase (1937), rational choice analyses of organizations have 
proliferated. Since most of the research on capitalist firms is conducted by 
economists, it is not reviewed here. However, some sociologists have also made 
contributions. Thus Petersen (1992) analyzes the earnings of 63,000 salesper- 
sons in 178 department stores and finds that those who work under output- 
related payment systems (and receive either straight commission or salary plus 
commission) earn considerably more than those who work under non-output- 
related payment systems (and receive straight salary alone). Workers in the 
former system earn more because they are individually rewarded for working 
harder, while those under the latter system do not because they have no incen- 
tive to work harder. (For another analysis of the capitalist firm, see Raub & 
Keren 1993.) 

Hedstróm (1994) combines rational choice with network analysis to explain 
the spatial diffusion of labor unions in Sweden from 1890 to 1940. Individuals' 
decisions to form a labor union are influenced not only by their expectations 
of what others will do, but also by their knowledge of what others in their 
immediate network have done in the past. As anticipated, both spatial properties 
of the network and its density turn out to be important determinants of the 
speed and success of labor union formation. Brüder et al (1993) explain 
career mobility in a large West German company in terms of the organizational 
structure and hierarchies, and Greve (1994) similarly explains job mobility in 
Norway in terms of organizational diversity. 

Haveman (1995, Haveman & Cohen 1994) uses the characteristics of the 
industry and organizational ecology at the meso level to account for individual 
careers and job turnover at the micro level, which then aggregate to tenure dis- 
tributions at the organizational level. Her work therefore exemplifies the multi- 
level causal structure (albeit meso-micro in this case) typical of rational choice 
theory. Abell (1988, 1990) studies industrial producer cooperatives in Tanzania, 
Sri Lanka, and Fiji and concludes that the key to their success lies in support 
organizations. While agricultural and commercial sectors have distinct capital 
advantages over the industrial sector in these developing nations, these estab- 
lished sectors have no compelling incentive to promote cooperatives. Abell & 
Mahoney (1988) examine similar cooperatives in India, Peru, and Senegal and 
note that one of the determinants of their success in these countries is the solidar- 
ity, whether exogenously or endogenously produced, of cooperative members. 

Other rational choice sociologists focus on noncapitalist firms and on uncon- 
ventional organizations like urban and criminal gangs. Walder's (1986) analysis 
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of Chinese factories focuses on the attainment of compliance in a communist 
system. Although workers in capitalist firms usually are motivated both by 
firm-internal sanctions and by pressure from the labor market, the latter source 
of incentives was largely unavailable in China at the time of Walder’s study. 
Instead, motivation was provided by much stronger dependence and monitoring 
mechanisms than those typically found in capitalist firms. Workers in China 
depended on factory leaders for the satisfaction of a broad range of social bene- 
fits that could not be satisfied elsewhere. Dependence was further increased by 
workers’ inability to switch jobs. (Hechter & Kanazawa 1993 show that similar 
mechanisms are routinely employed in capitalist firms in Japan.) Nee (1992) 
accounts for the relative success of private firms over their state and collec- 
tive counterparts in the transitional economy of China in terms of the former’s 
transaction cost advantage over the latter. In contrast, Walder (1995) attributes 
China’s rapid industrial growth under its transitional economy to the higher 
efficiency of public firms, especially those run by local governments, which 
face different financial incentives and constraints than the central government. 

Two notable qualitative contributions to the literature on organizations are 
Gambetta's (1993) analysis of the Sicilian Mafia, and Jankowski's (1991) ethno- 
graphic study of urban street gangs in three American cities. For Gambetta, 
the Mafia is a firm offering private protection in a society where state pro- 
tection often is unreliable. (Jankowski’s gangs also offer protection for the 
residents of low-income neighborhoods who are not well-served by police.) 
Because the Mafia lacks legitimacy and cannot guarantee its products, it must 
first convince potential customers of their quality. Gambetta interprets seem- 
ingly bizarre aspects of the Mafia—its arcane rituals, obscure symbols, and 
blood ceremonies—as attempts to make credible commitments to customers 
and rivals alike. 


Crime and Deviance 


Deterrence theory has long been identified as an application of rational choice 
(Gibbs 1975). With Cornish & Clarke (1986), rational choice theories of crime 
emerged to explain criminal behavior as a function of expected reward and 
punishment, weighted by the subjective probability of detection (Piliavin et al 
1986). Jacobs’ (1996) interviews with 40 active crack dealers show that their 
behavior and their interpersonal strategies are jointly determined by their desire 
to maximize profit from their sales and to minimize the possibility of arrest by 
undercover narcotics officers. The dealers, who mostly interact with anony- 
mous clientele, employ a perceptual shorthand on unfamiliar clients; these 
are observational and testing techniques to detect deception on the part of the 
clients. Horney & Marshall’s (1992) analysis of data collected from incarcer- 
ated adult offenders of major felonies confirms that the criminals’ subjective 
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perception of the risk of detection is realistically updated by their own experi- 
ence in the particular crime; those who commit the crime often without getting 
caught lower their expectation of detection while those who are caught most 
of the time subsequently increase their perception of risk. Recently, rational 
choice perspectives on crime have converged with the routine activity approach 
(Clarke & Felson 1993). Both distinguish between criminality (the decision 
to become involved in a particular form of crime) and crime (an actual crimi- 
nal event), and explain the latter in terms of opportunities—the simultaneous 
presence of criminal target and absence of risk of detection. 

A distinctive thrust of rational choice theories of crime is their conception 
of criminality as a function of higher expected material returns from crimi- 
nal activities relative to those from conventional pursuits. Williams (1989), 
Jankowski (1991), and MacLeod (1995) all provide ethnographic accounts to 
show that teenagers in low-income neighborhoods make deliberate decisions 
to pursue criminal activities because few legitimate alternative means to make 
as much money are available to them. Jankowski (1991, pp. 180—193) notes 
that, in some chronically poor neighborhoods, gang membership has become a 
family tradition, and many parents actively encourage their children to belong 
to the same gangs to which they themselves belonged when they were younger, 
much as parents with Ivy League educations encourage their children to attend 
their alma maters. Pezzin (1995) argues that economic incentives and opportu- 
nity costs exert a powerful influence on criminal career duration and desistance 
choice. Her analysis of the National Longitudinal Survey of Youth shows that 
future expected criminal earnings have a strong negative effect on the decision 
to terminate a criminal career among youths between the ages of 16 and 22. 
Since the effect of the availability of legal income-generating activities is just as 
strong as that of perceived cost of punishment, Pezzin argues that the carrot may 
work just as efficiently as the stick in shortening the length of criminal careers. 

Pampel & Gartner (1995) provide a multilevel explanation for variations in 
homicide rates in 18 advanced industrial nations. Other researchers stress the 
need to include institutional factors in the explanations of crime. Hechter & 
Kanazawa (1993) use Hechter’s (1987) theory of group solidarity to explain the 
low crime rates in contemporary Japan. They argue that the higher dependence 
of the Japanese on their groups and their higher visibility within the groups in- 
crease their conformity to social norms. They reject cultural explanations of low 
crime rates in Japan by showing that China and South Korea, which share many 
cultural characteristics with Japan, have higher crime rates, and that the same 
mechanisms of dependence and visibility work to induce conformity among 
American auto workers in Japanese transplants factories. Petee et al’s (1994) 
finding that the deterrent effects of informal sanctions are the strongest among 
communities characterized by higher levels of integration is also consistent 
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with Hechter's (1987) theory, which predicts that compliance to norms is a 
multiplicative function of the dependence and the efficiency of social control. 

Finally, one of the most influential modern theories of crime and deviance, 
Hirschi's (1969) control theory, has a strong rational choice flavor (see Hirschi 
1986, 1994). This theory explains juveniles’ propensity to engage in delin- 
quency in terms of preferences ( attachment," "belief ”), opportunity cost (com- 
mitment”), and resource scarcity ("involvement"). Nagin & Paternoster (1994), 
among others, provide empirical support for control theory. 


Comparative-Historical Sociology 


Many have claimed that the scope of rational choice theory is limited to the 
capitalist era, for previous social formations were much less individualistic. 
However, rational choice historical sociologists have accounted for variation in 
state autonomy and policy implementaticn in absolutist societies on the basis 
of dependence and control mechanisms. Thus Kiser (1986-1987) suggests 
that the most autonomous early modern monarchs had independent resources 
and faced the weakest monitoring (legislative and judicial) institutions. Kiser 
et al (1995) show that in the case of war—a policy that rulers tended to favor 
more than their subjects—high ruler autonomy resulted in more frequent war 
initiation in early modern Western Europe. > 

Even the most autonomous rulers must delegate authority to agents if they are 
to implement policies, however. Hence, agency theory ought to cast light on the 
causes and consequences of tax administration. For instance, when the ruler's 
monitoring capacity is poor, tax farming should be a more efficient means of 
providing state revenue than bureaucratic administration because it gives agents 
stronger incentives to collect taxes and surrender them to rulers (Kiser 1994). 
Supportive evidence is found both in early modern Prussia (Kiser & Schneider 
1994) and Imperial China (Kiser & Tong 1992). In a similar vein, Adams 
(1996) explains the failure of the Dutch East Indies Company, and contrasting 
success of the English East Indies Company, in terms of the problems that the 
European owners faced in controlling their agents in Asia. The Dutch company 
collapsed when colonial servants saw better alternative opportunities offered 
by the English company, and the Dutch owners were not able from afar to stop 
their servants' defection. 

Explaining major institutional shifts is one of the great challenges of his- 
torical sociology. Both of the popular perspectives in historical sociology— 
structuralism and cultural analysis—would appear to face insuperable difficul- 
ties in accounting for institutional change endogenously. One such institutional 
change is the subject of Friedman's (1995) recent study. Until the late nineteenth 
century, in all Western societies the custody of children after their parents’ di- 
vorce or separation was presumptively and routinely awarded to their fathers. 
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Then, between 1880 and 1920, in all of these countries presumptive custody 
shifted from the father to the mother. Due to rising divorce rates, these states 
faced the prospect of having to support a large number of divorced women on 
welfare. In order to avoid this expenditure, Friedman argues, they transferred 
the custody of children from fathers to mothers, making fathers responsible for 
the financial welfare of their children. Divorced fathers’ child support payments 
also secondarily contributed to the economic welfare of their former spouses, 
the custodial mothers. By this means, all Western countries avoided supporting 
divorced women and took responsibility only for the education of children. 


Political Sociology 


Although collective action is an abiding concern of rational choice research on 
politics, much of the effort has been theoretical and therefore falls outside of 
the purview of this review (for a comprehensive but acerbic review, see Udéhn 
1995). There are notable exceptions, however. In an impressive study, Brustein 
(1996) seeks to explain the rise of the Nazi party in interwar Germany. Hitler’s 
rise is often ascribed to his party’s manipulation of emotional voters susceptible 
to ressentiment, but Brustein argues that the popularity of the National Socialists 
was due to their strategic adoption of policies that appealed to the pocketbook 
interests of a large and socially heterogeneous part of the electorate. One of 
his intriguing conclusions is that Hitler had to conceal his antisemitism in order 
to gain popularity among blue-collar workers. Brustein’s theory is strongly 
supported by his analysis of data on more than 42,000 Nazi members who 
joined the party between 1925 and 1933. 

Survey data collected and analyzed by Opp et al (1995) reveal how the macro 
outcome of the successful protest movement in East Germany in 1989 was an 
agpregate result of the values, incentives, and network ties of Germans at the 
micro level. These individuals’ decisions to participate, in turn, were influenced 
by macrolevel political conditions. In contrast, Shlapentokh (1995) accounts 
for the absence of collective protests in Russia in the face of deteriorating social 
and economic conditions in 1992-1994 in terms of individuals’ expectations 
that such collective action would not be successful. 


Stratification and Mobility 


Human capital theory long has been the principal rational choice approach 
to stratification. A vast sociological literature on labor markets compares its 
merits to those of status attainment and structural perspectives. However, a new 
rational choice approach to stratification has been developed. Logan (1996b) 
employs a two-sided logit methodology to mode] employment outcomes (the 
matching of workers to jobs) as a function of explicitly rational choices made by 
workers and employers under structural constraints. The jobs that employers 


SOCIOLOGICAL RATIONAL CHOICE 205 


supply function as the constraints within which workers make their choices, 
given their preferences; the supply of workers functions as the constraint within 
which employers make their choices, given their preferences. Further, the 
preferences of the employers for certain types of workers create the constraints 
for the workers, and the preferences of the workers for certain types of jobs 
create the constraints for the employers. Logan’s analysis of General Social 
Survey data showcases the utility of this new method. 

Until 1967, secondary schools in Ireland charged their students tuition fees, 
and these costs were thought to be barriers to educational attainment for chil- 
dren from working-class families. However, when these tuition fees were 
eliminated by the educational reform of 1967, access to secondary education 
did not become more egalitarian. Raftery & Hout (1993) argue that, due to the 
high employment rate and high wages in the Irish economy in the late 1960s, 
the opportunity costs of continuing education (in the form of forgone wages) 
were high, relative to the cost of tuition fees, now eliminated. Thus, it did 
not make much economic sense for Irish children of working-class families 
to continue formal education and postpone full-üme employment, even in the 
absence of tuition fees, and the educational reform failed to have the intended 
effect. Breen & Whelan’s (1993) analysis concurs with Raftery & Hout (1993). 

Walder (1992) provides a macro-meso explanation of stratification among 
organizations in a socialist economy, and he tests it with survey data gathered in a 
large Chinese industrial city. Budgetary resources of the local government and 
its dependence on a work organization jointly determine how much revenue 
the local government extracts from the organization. The level of revenue 
extraction by the government determines (at the meso level) the organization's 
abilities to provide benefits to its employees, which then aggregates (at the 
macro level) to inequalities among organizations. Waider's (1992) explanation 
of organizational stratification in China is therefore another example of the 
multilevel causal structure of rational choice theory. 


Race and Ethnic Relations 


Most of the rational choice research in this area has been theoretical, with 
relatively few empirical applications. Clark (1991), however, finds empirical 
support for Schelling's (1971) theory that slight differences in residents' pref- 
erence for the racial composition of their neighborhood quickly lead to extreme 
segregation. Nee et al (1994) contend that the structure of ethnic and ethnically 
mixed economies in Los Angeles is a function of the economic opportunities 
and constraints that Asian immigrants face as well as of their skills and quali- 
fications. While most Asian immigrants prefer to work outside of their ethnic 
enclaves to take advantage of higher pay and better working conditions, many 
are unable to do so because of their lack of occupational and language skills. 
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One means of upward mobility available to immigrants of little occupational 
and language skills is entrepreneurship. Sanders & Nee (1996) demonstrate that 
the self-employment status among Asian and Hispanic immigrants is strongly 
facilitated by human capital (foreign-earned formal education) and financial 
and social capital available from their family members. Those with spouses 
and other family members who can contribute to their business are more likely 
to succeed as entrepreneurs than are single immigrants. 

Split labor market theory can explain how ethnic and racial antagonism arises, 
but it cannot account for interracial solidarity. Brown & Boswell (1995) use 
Heckathorn’s (1988, 1989, 1990) theory of group-mediated social control to 
address the possibility of interracial solidarity or strikebreaking under split labor 
market conditions. Their qualitative comparative analysis of the Great Steel 
Strike of 1919 largely supports their predictions. 

Chat’s (1996) theory of ethnogenesis seeks to explain the location of ethnic 
boundaries in developing societies. The theory proposes a link between coop- 
eration and altruism in small, rural communities. Then it predicts how these 
altruistic preferences, in conjunction with structural factors and rational behav- 
ior, generate boundaries for larger-scale ethnic collective action. The theory is 
applied to the creation of new ethnic groups among the Pan-Igbo in Nigeria, the 
Luba-Kasai in Zaire, the Pan-Malays in Malayasia, and the Muhajir in Pakistan. 

Group process researchers mostly concentrate on the dynamics of dyads 
and small groups. Research in this tradition began with Emerson’s (1962) 
seminal analysis of exchange relations in dyads, which was later extended to 
exchange networks (Cook & Emerson 1978). Currently at least four theories 
compete in the field of network exchange: Bienenstock & Bonacich’s core the- 
ory (1992, 1993; Bonacich & Bienenstock 1993, 1995); Cook & Yamagishi’s 
power-dependence theory (1992; Yamagishi & Cook 1993); Friedkin’s ex- 
pected value theory (1986, 1992, 1993); and Willer et al’s exchange resistance 
theory (Markovsky et al 1988, Markovsky et al 1993, Lovaglia et al 1995, 
Skvoretz & Lovaglia 1995). Since this field relies heavily on the experimental 
testing of key hypotheses (Skvoretz & Willer 1993), it resembles what Collins 
(1994) calls “high-consensus, rapid-discovery” science. 

Currently there is a debate about the determinants of dyadic cohesion among 
self-interested rational actors. On the one hand, Kollock (1994) argues that 
when the outcome of an exchange is uncertain, individuals make commit- 
ments to particular exchange partners in order to reduce uncertainty (see also 
Yamagishi & Yamagishi 1994). On the other, Lawler & Yoon's (1993, 1996) 
theory of relational cohesion maintains that individuals experience positive 
affect from successfully negotiated and completed exchanges. Then self- 
interested actors develop affective commitment to particular exchange partners, 
even in the face of more attractive offers, in order to consume this positive affect 
from continued exchange with the same partner. 
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Sometimes research on group processes has clear macrolevel implications. 
Orbell & Dawes’ (1991, 1993) "cognitive miser" theory explains social welfare, 
order, and efficiency at the macro level in terms of dyadic interactions at the 
micro level. Their theory argues that intending cooperators are more likely 
to play dyadic Prisoner's Dilemma games than intending defectors, when they 
have the option not to play. Then the outcomes of most completed games are 
mutual cooperation, whereby the players enjoy the fruit of cooperation, which 
aggregates to order and economic surplus at the macro level. Their laboratory 
experiment (Orbell & Dawes 1993) confirms the theory's crucial hypothesis 
that intending cooperators are more likely to choose to play than intending 
defectors. 

The problem of cooperation and public goods provision is usually conceived 
of as an n-person Prisoner's Dilemma game. In contrast, Diekmann's (1985, 
1986, 1993) “volunteer’s dilemma" game captures the problem of cooperation 
in situations where there is a potential diffusion of responsibility (Darley & 
Latané 1968, Latané & Darley 1970). The public good in the volunteer's 
dilemma is produced by a single volunteer (with the effect of additional volun- 
teers completely superfluous), and there is no pure dominant strategy; players 
are better off defecting if there is at least one cooperator, but better off cooperat- 
ing if there is none. The volunteer's dilemma points to the utility of sociological 
rational choice theory in that the use of a strictly game-theoretic solution—a 
mixed Nash equilibrium strategy—leads to highly inefficient outcomes. An 
optimal level of cooperation requires communication and coordination among 
actors, which is fostered by social norms (Diekmann 1985). The experimental 
evidence shows that subjects do not employ strictly game-theoretic solutions 
(Diekmann 1986, 1993). 


Medical Sociology 


Whereas medicine has been an important site for the application of behavioral 
decision theory, rational choice ts in its infancy in medical sociology. Thoits 
(1994), however, calls for more emphasis on human agency, and on the delib- 
erate choices that individuels make, in the study of stress and depression. Her 
analysis of panel data shows that individuals are often able to solve potential 
problems in their lives (and thus are not affected by them), and that only those 
they fail to solve lead to psychological symptoms. Although Pescosolido's 
(1992) "social organization strategy" framework ostensibly is critical of ratio- 
nal choice, it is entirely consistent with the version of rational choice presented 
in the present essay. Her analysis of individuals seeking medical help specif- 
ically incorporates network structures and uses network ties and information 
gathered from past interactions as resources and social capital. (Knoke 1990 
and Burt 1992 offer other examples of how network analysis can be integrated 
with rational choice.) 
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Broadhead & Heckathorn (1994, Broadhead et al 1995) conceive of AIDS 
prevention as a collective action problem. They use agency theory and 
Heckathorn’s theory of group-mediated social control to argue that peer-driven 
intervention programs are more effective in combating AIDS. Extensive ethno- 
graphic data support their contention. 


CONCLUSION 


Sociological rational choice is an inherently multilevel enterprise. It seeks to 
account for social outcomes on the basis of both social context and individual 
action. In this respect it often differs, at least in emphasis, from other (thin) 
versions of rational choice theory that are employed in much economic analysis 
and game theory. Sociological rational choice is beginning to make empirical 
contributions to a broad range of substantive topics in the discipline. While 
applications of rational choice in subfields like politics, labor markets, formal 
organizations, and criminology by now are traditional, the approach has also 
begun to make empirical advances in areas formerly regarded as inhospitable, 
such as the family, gender, and religion. At the same time, a growing number 
of theoretically uncommitted empirical researchers are testing rational choice 
theories—sometimes unwittingly—in their own domains. These studies will 
reveal much about the scope and limits of rational choice theory. Invariably, 
their findings will strengthen certain versions of rational choice theory at the 
expense of others. 

Beyond the crucial task of subjecting existing rational choice theories to 
empirical tests, we discern three future directions for developing sociological 
rational choice theory. To better explain social outcomes arising from individual 
action, rational choice theorists must begin to understand the origin and the 
nature of values that motivate human behavior (Hechter et al 1993). Values— 
and the preferences derived from them—are one of two major categories of 
determinants of individual behavior in rational choice theory, the other being 
institutional constraints. Rational choice has been mute on the origin and 
nature of individual values. This silence has been justified under the dictum 
de gustibus non est disputandum. Due to the difficulty of arriving at valid and 
reliable measures of internal states (Hechter 1992), rational choice theorists 
usually impute values to actors by assumption. The typical value assumption 
has been wealth maximization because wealth is highly fungible. Thus actors 
who may hold various idiosyncratic values can be expected to pursue wealth 
as a means to achieve their divergent goals (Hechter 1994). Yet some research 
also has advanced atypical value assumptions, including uncertainty reduction 
(Friedman et al 1994), local status (Frank 1985), distributive justice (Jasso 
1990, 1993), and regret-dissonance reduction (Chai, forthcoming). 
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Even if we know what values and preferences actors hold, their decision- 
making mechanisms are contested. Hence the study of decision algorithms is 
a second promising field of inquiry. Subjective expected utility maximization 
theory, the foundation of most macroscopic rational choice theories, posits that 
actors are forward-looking maximizers. Based on the available information and 
their best estimates of what the future holds, actors assign subjective probabil- 
ities to various future states of the world and make their decisions according 
to these subjective probabilities. However, critics have pointed out that this 
forward-looking process of decision-making is cognitively too demanding for 
most human actors (Petersen 1994). Others have proposed alternative deci- 
sion algorithms. Macy (1993) argues that individuals are backward-looking 
adaptive learners who adjust their decisions on the basis of the past outcomes 
associated with their choices, and he provides experimental evidence (Macy 
1995) supporting his contention. In contrast, Heckathorn (1996) argues that 
individuals sometimes are sideways-looking cultural imitators whose decisions 
emulate those made by their neighbors who are doing well. In his view, whether 
actors are forward-, backward-, or sideways-looking depends on the nature of 
available information about the future. If genuine and accurate information is 
available about the future, then actors will assess the subjective probabilities 
of various states of the world as predicted by the subjective expected utility 
maximization model. If the past is the best predictor of the future, then actors 
will be backward-looking. If the future is best known by observing who is 
doing well in the present, then actors will be sideways-looking imitators. 

The ultimate challenge for sociological rational choice theory is explaining 
the emergence of institutions. Unlike values and preferences, institutions are 
intersubjective and therefore more easily measurable. So far this has not led 
to concerted research on institutional emergence, however. Still, a handful 
of forays have been made. Contributions to Hechter et al (1990) begin to 
tackle the emergence of a small handful of institutions, and Coleman (1990) 
provides one framework for understanding how norms emerge among actors 
with specific interests and resources. Logan’s (1996a,b) two-sided logit model 
offers a promising new way to trace the origins of institutional constraints for 
actors to their exchange partners’ preferences. And Chai & Hechter (1997) 
propose an elementary model for the emergence of the state and of social order. 
However, an endogenous theory of institutional emergence remains a distant 
goal at this time. 


ACKNOWLEDGMENTS 


Thanks to Mary Brinton, Sun-Ki Chai, Karen Cook, and Christine Horne for 
comments on an earlier draft of this review. 


210 HECHTER & KANAZAWA 


Visit the Annual Reviews home page at 
http://www.annurev.org. 


Literature Cited 


Abell P. 1988. Establishing Support Systems for 
Industrial Co-Operanves: Case Studies from 
the Third World. Aldershot, Engl: Avebury 

Abell P. 1990 Supporting industrial coopera- 
tives 1n developing countries. some Tanza- 
nian experiences Econ Indust. Democracy 
11 483—504 

Abell P. 1991, Homo sociologicus: Do we need 
him/her? Sociol. Theory 9:195—98 

Abell P, Mahoney N. 1988. Small-Scale Indus- 
trial Producer Co-Operatives in Developing 
Countries Delhi: Oxford Univ Press 

Adams J. 1996 Principals and agents, colonial- 
ists and company men: the decay of colonial 
control in the Dutch East Indies Am Sociol. 
Rev. 61,1228 

Baron JN, Hannan MT 1994. The impact of 
economics on contemporary sociology J 
Econ. Lit. 22 1111-46 

Becker GS. 1974 A theory of marriage: Part II 
J Polit. Econ 82 811-826 

Becker GS 1976 The Economic Approach to 
Human Behavior Chicago: Univ. Chicago 


Press 

Becker GS. 1981. A Treatise on the Family 
Cambridge: Harvard Univ. Press 

Becker GS. 1996. Accounting for Tastes. Cam- 
bndge: Harvard Univ. Press 

Bienenstock EJ, Bonacich P 1992. The core as a 
solution to exclusionary networks Soc. Net- 
works 14:231—44 

Bienenstock EJ, Bonacich P. 1993 Game- 
theory models forexchange networks. exper- 
imental results. Sociol Perspect. 36:117-35 

Bonacich P, Bienenstock EJ. 1993 Assignment 
games, chromatic number, and exchange the- 
ory. J. Math Sociol 17 243-59 

Bonacich P, Bienenstock EJ 1995. When ratio- 
nality fails. unstable exchange networks with 
empty cores, Ration. Soc. 7:293—320 

Breen R, Whelan CT. 1993. From ascription to 
achievement?. origins, education and entry 
to the labour force in the Republic of Ire- 
land during the twentieth century. Acta So- 
ciol. 36:3-17 

Brewster KL 1994, Race differences in sexual 
activity among adolescent women: the role 
of neighborhood characteristics. Am. Sociol. 
Rev. 59:408—24 

Brewster KL, Billy JOG, Grady WR 1993. So- 
cial context and adolescent behavior: the im- 
pact of community on the transition to sexual 
activity Soc Forces 71:713—40 

Brinton MC. 1989 Gender stratification in con- 


temporary urban Japan. Am Sociol. Rev 
9—64 


54: 

Brinton MC 1992. Christmas cakes and wed- 
ding cakes. the social organization of 
Japanese women's hfe course. In Japanese 
Social Organization, ed TS Lebra, pp. 79— 
107. Honolulu Univ Hawai: Press 

Brinton MC. 1993 Women and the Economic 
Miracle: Gender and Work in Postwar Japan 
Berkeley: Univ Calif Press 

Broadhead RS, Heckathorm DD 1994. AIDS 
prevention outreach among injection drug 
users agency problems and new approaches 
Soc Probl. 41:473—95 

Broadhead RS, Heckathorn DD, Grund J-PC, 
Stern LS, Anthony DL. 1995. Drug users out- 
reach workers in combating AIDS: prelimi- 
nary results of a -driven intervention. J, 
Drug Issues 25:531—64 

Brown C, Boswell T. 1995, Strikebreaking or 
solidarity in the great steel strike of 1919 a 
split labor market, game-theoretic, and QCA 
analysis. Am J Sociol 100 1479-519 

Bruderl J, Preisendorfer P, Ziegler R. 1993 Up- 
würd mobihty in organizations: the effects 
of hierarchy and opportunity structure. Eur. 
Sociol, Rev. 9.173-88 

Brustein W. 1996 The Logic of Evil. The Social 
Origins of the Nazi Party, 1925 to 1933. New 
Haven Yale Univ. Press 

Burt RS 1992. Structural Holes: The Social 
Structure of Competition Cambridge, MA 
Harvard Univ Press 

Chai S-K. 1996. A theory of ethnic grou 
boundanes Nations Nationalism 2:281—30 

Chai S-K. 1997. Creating Identity from Rational 
Action: A General Theory of Preference and 
Belief Formation. Ann Arbor Univ Mich. 
Press. In press 

Cha: S-K, Hechter M. 1997, A theory ofthe state 
and of social order J Math. Soc. In press 

Chaves M, Cann DE. 1992, Regulation, plural- 
ism, and religious market structure: explain- 
ing religion’s vitality. Ration Soc. 4°272- 

0 


9 

Clark WAV 1991 Residential preferences and 
neighborhood racial segregation’ a test of 
the Schelling segregation model. Demogra- 
phy 28-1-19 

Clarke RV, Felson M. 1993 Routine Activity 
and Rational Choice. New Brunswick, NJ 
Transaction 

Coase RH 1937 The nature of the firm Eco- 
nomica N S 4 386—405 


SOCIOLOGICAL RATIONAL CHOICE 


Coleman JS. 1990 Foundations of Social The- 
ory Cambridge. Harvard Univ Press 

Coleman JS. 1993. The rational reconstruction 
of society. Am Sociol, Rev 58 1-15 

Collins R. 1994, Why the social sciences won’t 
become high-consensus, rapid-discovery sci- 
ence Sociol Forum 9 155—771 

Cook KS, Emerson RM. 1978 Power, equity 
and commitment in exchange networks Am 
Sociol Rev 43:721-39 

Cook KS, Yi ishi T 1992 Power in ex- 
change networks: a power-dependence for- 
mulation. Soc. Networks 14 245-66 

Comush DB, Clarke RV, eds. 1986 The Reason- 
ing Criminal: Rational Choice Perspectives 
on Offending New York: Spnnger-Verlag 

Cowen T 1996 Why women succeed, and fail, 
in the arts. J. Cult. Econ. 20°93-113 

Darley JM, Latané B. 1968. Bystander interven- 
tion m emergencies: diffusion of responsibil- 
ity J. Pers. Soc. Psychol 8 377-83 

Diekmann A 1985. Volunteer's dilemma J 
Confl. Resolut. 29-605—10 

Diekmann A. 1986. Volunteer’s dilemma’ a 
social trap without a dominant and 
some experimental results In Paradoxical 
Effects of Social Behavior: Essays in Honor 
of Anatol Rapoport, ed. A Diekmann, P Mit- 
ter. Heidelberg: Physica- Verlag 

Diekmann A. 1993. Cooperation in an asym- 
metric volunteer's dilemma game: theory 
and experimental evidence Int J Game The- 
ary 22:75-85 

Emerson RM 1962 Power-dependence rela- 
tions. Am, Sociol. Rev 27-31-41 

England P, Kilbourne BS. 1990. Feminist cri- 
ses of the separative model of self ım- 
plications for rational choice theory Ratton. 
Soc. 2 156-71 

Farkas G 1996. Human Capital or Cultural 
Capital? Ethnicity and Poverty Groups in an 
Urban School District. New York: Aldine 

Ferber MA, Nelson JA, eds. 1993 Beyond 
Economic Man: Feminist Theory and Eco- 
nomics Chicago. Univ. Chicago Press 

Ferejohn JA. 199] Rationality and interpreta- 
tion’ Parliamentary elections 1n early Stuart 
England. In The Economic Approach to Pol- 
incs: A Critical Reassessment of the Theory 
of Rational Action, ed KR Monroe, pp 279— 
305 New York: HarperCollins 

Finke R, Guest AM, Stark R 1996 Mobilizing 
Jocal religious markets: religious pluralism 
in the Empire State, 1855 to 1865 Am So- 
ciol. Rev. 61:203-18 

Finke R, Stark R. 1988 Religious economies 
and sacred canopies religions mobilization 
in American cities, 1906 Am Sociol Rev 
53:41-49 

Finke R, Stark R. 1992. The Churching of Amer- 
ica 1776-1990: Winners and Losers in Our 


211 


Religious Economy New Brunswick, NJ: 
Rutgers Univ. Press 

Fischhoff B. 1991 Value elicitation’ Is there 
anything 1n there? Am. Psychol 46°835—47 

Frank RH. 1985. Choosing the Right Pond: Hu- 
man Behavior and the Quest for Status. New 
York’ Oxford Univ Press 

Fnedkin N 1986 A formal theory of social 
power J Math. Sociol, 12:103—26 

Fnedkin N 1992. An expected value model of 
social power predictions for selected ex- 
change networks. Soc. Networks 14:213—30 

Friedkin N 1993. An expected value model 
of social exchange outcomes In Advances 
in Group Processes, ed EJ Lawler, B 
Markovsky, K Heumer, J O'Brien, 10:163- 
93 Greenwich, CT JAI 

Friedman D 1995. Towards a Structure of In- 
difference: The Social Origins of Maternal 
Custody New York: Aldine de Gruyter 

Friedman D, Diem C. 1990 Comments on Eng- 
land and Kilbourne. Ration. Soc. 2:517—21 

Friedman D, Diem C. 1993. Feminism and the 
pro- (rational-) choice movement: rational- 
Choice theory, feminist critiques, and gender 
inequality. In Theory on Gender/Feminism 
on Theory, ed. P England, pp 91-114. New 
York: Aldine de Gruyter 

Friedman D, HechterM 1988 The contribution 
of rational choice theory to macrosociologi- 
cal research Sociol Theory 6:201—18 

Fnedman D, Hechter M, Kanazawa S. 1994. A 
theory af the value of children Demography 
31:375-401 

Gambetta D 1993 The Sicilian Mafia: The 
Business of Private Protection, Cambridge 
Harvard Univ, Press 

Gibbs JP. 1975. Crime, Punishment and Deter- 
rence New York: Elsevier 

Gold JH. 1996. The quantitative analy- 
sis of large-scale data-sets and rational ac- 
tion theory. for a sociological alliance. Eur. 
Sociol. Rev. 12:109-26 

Green DP, Shapiro I. 1994. Pathologies of Ra- 
tonal Choice Theory: A Critique of Applica- 
tons in Political Science New Haven Yale 
Univ. Press 

Greve HR. 1994 Industry diversity effects on 
job mobility. Acta Sociol. 37:119—39 

Hagan J, MacMillan R, Wheaton B. 1996. New 
kid in town. social capital and the life course 
effects of family migration on children. Am. 
Sociol, Rev 61°368-85 

Hamberg EM, Pettersson T. 1994. The religious 
market denominational compettion and re- 
ligious participation 1n contemporary Swe- 
den J. Scr Study Relig. 33.205—16 

Haveman HA. 1995 The demographic meta- 
bolism of organizations industry dynamics, 
turnover, and tenure distributions Admin 
Sci Q 40:586-618 


212 HECHTER & KANAZAWA 


Haveman HA, Cohen L. 1994. The ecological 
dynamics of careers: the impact of organiza- 
tional founding, dissolution, and merger on 
Job mobility Am. J Sociol. 100:104—52 

Hechter M. 1987. Principles of Group Solidar- 
ity. Berkeley: Univ Calif Press 

Hechter M. 1992. Should values be written out 
of the social scientist’s lexicon? Sociol, The- 
ory 10 214-30 

Hechter M. 1994. The role of values in rational 
choice theory. Ration, Soc. 6:318—33 

Hechter M. 1996. Through thick and thin: How 
far can theory predict behaviour? Times Lut. 
Suppl March 29, p 15 

Hechter M, Kanazawa S. 1993. Group solidar- 
ity and social order in Japan. J. Theor. Polit. 
5:455—93 

Hechter M, Nadel L, Michod RE 1993 The 
Origin of Values. New York. Aldine de 
Gruyter 

Hechter M, Opp K-D, Wippler R 1990. Social 


Institutions: Their Emergence, Maintenance. 


and Effects. New York Aldine de Gruyter 

Heckathom DD 1988 Collective sanctions and 
the creation of prisoner’s dilemma norms 
Am J Sociol 94 535-62 

Heckathorn DD. 1989. Collective action and the 
second-order free-nder problem. Ration. Soc. 
1.78-100 

Heckathorn DD 1990 Collective sanctions and 
compliance norms: a formal theory of group- 
mediated social control. Am Sociol Rev. 
55 366-84 

Heckathorn DD 1996 The dynamics and 
dilemmas of collective action Am. Sociol 
Rev. 61.250—77 

Hedström P. 1994. Contagious collectivities. on 
the spatial diffusion of Swedish trade unions, 
1890-1940. Am. J Sociol 99 1157—79 

Hirschi T 1969 Causes of Delinquency Berke- 
ley. Univ. Calif. Press 

Hirsch T. 1986 On the compatibility of rational 
choice and social control theories of crime 
See Cornish & Clarke 1986, pp 105-18 

Hırschı T 1994 Preface to the J Edi- 
tion. Causes of Delinquency. Tokyo: Bunka 
Shobo 

Hoem JM. 1991. To marry, just 1n case...: the 
Swedish widow's-pension reform and the 
peak in marriages in December 1989, Acta 
Sociol 34°127-35 

Hoem JM. 1993. Public policy as the fuel of fer- 
tility. effects of a policy reform on the pace 
of childbearing in Sweden in the 1980s. Acta 
Sociol. 36:19-31 

Homey J, Marshall IH 1992 Risk perceptions 
among serious offenders: the role of crime 
and punishment. Criminology 30:575-—-94 

IannacconeLR 1991 The consequences of rel- 
gious market structure. Adam Smuth and the 
economics of religion. Ration Soc. 3:156—77 


Iannaccone LR. 1992. Sacrifice and stigma. re- 
ducing free-riding 1n cults, communes, and 
other collectivities. J. Polit. Econ. 100:271— 
91 

Iannaccone LR. 1994, Why strict churches are 
strong Am J. Sociol, 99:1180-211 

Iannaccone LR, Olson DVA, Stark R. 1995 
Religious resources and church growth. Soc. 
Forces 74 705-31 

Jacobs BA. 1996. Crack dealers and restrictive 
deterrence: identifying narcs. Criminology 
34.409—31 

Jankowski MS. 1991. Islands in the Street. 
Si T and American Urban Society. Berke- 
ley: Univ. Calif. Press 

Jasso G. 1988 Whom shall we welcome? elite 
judgments of the criteria for the selection of 
immi Am Sociol Rev 53:919-32 

Jasso G. 1990. Methods for the theoretical 
and empirical analysis of comparison pro- 
cesses. Sociological Methodology 1990, ed. 
CC Clogg Washington: DC. Am. Sociol As- 


soc. 

Jasso G 1993 Choice and emotion in compar- 
ison theory Ration. Soc. 5:231—74 

Jasso G, Rosenzweig MR. 1990. The New Cho- 
sen People: Immgrants in the United States 
New York: Russell Sage Found. 
Kelley D 1972. Why Conservative Churches 
are Growing. New York: Harper & Row 
Kiser E. 1986-1987. The formation of state pol- 
icy 1n western European absolutisms. a com- 
parison of England and France. Polit. Soc. 
15;259--96 

Kiser E, 1994 Markets and hierarchies 1n early 
modern tax systems: a principal-agent anal- 
ysis. Polit. Soc. 22:284~315 

Kiser E, Drass KA, Brustein W. 1995 Ruler 
autonomy and war in early modern western 
Europe. Int Stud. Q. 39:109-38 

Kiser E, Schneider J 1994 Bureaucracy and 
efficiency’ an analysis of taxation ın early 
modern Prussia Am Sociol Rev 59:187-204 

Kiser E, Tong X. 1992 Determinants of the 
amount and type of corruption in state fiscal 
bureaucracies: an analysis of taxation in early 
modern Prussia Comp. Polit. Stud. 25.300— 
3l 

Knoke D. 1990. Political Networks: The Struc- 
tural Perspective. Cambridge: Cambridge 
Univ. Press 

Kollock P. 1994, The emergence of exchange 
structures. an experimental study of uncer- 
tainty, commitment and trust. Am. J. Sociol. 
100:315-45 

Latané B, Darley JM. 1970. The Unresponsive 
Bystander’ Why Doesn't He Help?’ New 
York’ Appleton-Century-Crofts 

Lawler EJ, Yoon J 1993 Power and the emer- 
gence of commitment behavior 1n negotiated 
exchange. Am. Sociol Rev 58°465-81 


SOCIOLOGICAL RATIONAL CHOICE 


Lawler EJ, Yoon J 1996. Commutment in ex- 
change relations. test of a theory of relational 
cohesion. Am. Sociol. Rev. 61°89-108 

Lee BA, Oropesa RS, Kanan JW. 1994, Neigh- 
borhood context and residential mobility De- 
mography 31:249—70 

Lloyd KM, South SJ 1996. Contextual influ- 
ences on young men’s transition to first mar- 
nage Soc. Forces 74:1097—1119 

Logan JA. 1996a. Rational choice and the TSL 
model of occupational opportunity Ration. 
Soc 8207-30 

Logan JA. 1996b. Opportunity and choice 1n so- 
cially structured labor markets. Am J Sociol 
102:114-60 

Lovaglia MJ, Skvoretz J, Willer D, Markovsky 
B. 1995 Negotiated exchanges in social net- 
works Soc. Forces 74.123—55 

Mackie G 1996 Ending footbinding and in- 
fibulation: a convention account Am Sociol 
Rev. 61.999-1017 

MacLeod J. 1995. Ain't No Makin’ It. Áspira- 
tions and Attainment in a Low-Income Neigh- 
borhood Boulder, CO. Westview 

Macy MW. 1993 Social learning and the 
structure of collective action. In Advances 
in Group Processes, ed. EJ Lawler, B 
Markovsky, K Heimer, J O'Brien, pp. 1-35. 
Greenwich, CT JAI 

Macy MW. 1995 PAVLOV and the evolution of 
cooperation an experimental test. Soc. Psy- 
choi Q 58-74-87 

Markovsky B, Skvoretz J, Willer D, Lovagha 
MJ, ErgerJ 1993. The seeds of weak power 
an extension of network exchange theory 
Am. Sociol Rev 58:197—209 

Markovsky B, Willer D, Patton T 1988 Power 
relations in exchange networks Am Sociol 
Rev 53:220-36 

Moen P, Wethington E 1992 The concept of 
family adaptive strategies. Annu. Rev. Sociol 
18 233-51 

Nagin DS, Paternoster R. 1994, Personal capi- 
tal and social control: the deterrence mplica- 
tions of a theory of individual differences ın 
criminal offending. Criminology 32:581—606 

Nee V 1992. Organizational dynamics of mar- 
ket transition. hybrid forms, property nghts, 
and mixed economy in China, Admin. Sci, Q. 
37:1-27 

Nee V, Sanders JM, Sernau S. 1994. Job tran- 
sitions in an immigrant metropolis ethnic 
boundaries and the mixed economy Am So- 
ciol Rev. 59:849—72 

Opp K-D, Voss P, Gern C. 1995 Ongins of a 
Spontaneous Revolution, Ann Arbor. Univ. 
Mich Press 

Orbell J, Dawes RM. 1991. A cognitive miser 
theory of cooperators’ advantage. Am Polit 
Sci. Rev. 85.515-28 

Orbell JM, Dawes RM. 1993. Social welfare, 


213 


cooperators' advantage, and the option of not 
praying the game. Am. Sociol. Rev, 58.787- 
0 


Pampel FC, Gartner R. 1995 Age struc- 
ture, socio-political institutions, and national 
homicide rates. Eur Sociol Rev 11243-60 

Pescosolido BA. 1992, Beyond rational choice. 
the social dynamics of how people seek help 
Am J. Sociol, 97,1096-1138 

Petee TA, Milner TF, Welch MR. 1994. Levels 
of social integration in group contexts and 
the effects of informal sanction threat on de- 
viance. Criminology 32 85—106 

Petersen T 1992. Payment systems and the 
structure of inequality: conceptual issues and 
an analysis of salespersons in department 
stores Am. J Sociol. 98:67--104 

Petersen T 1994 On the promise of game- 
theory in sociology Contemp. Sociol. 
23:498—502 

Pezzin LE. 1995, Barnings prospects, matching 
effects, and the decision to terminate a crim- 
inal career J Quant Criminol. 11:29-50 

PiliavinI, Thornton C, Gartner R, Matsueda RL. 
1986. Crime, deterrence, and rational choice. 
Am Sociol, Rev 51 101-19 

Raftery AE, Hout M 1993. Maximally main- 
tained inequality: expansion, reform, and op- 
portunity in Insh education, 1921-1975. So- 
ciol Educ. 66:41-62 

Raub W, Keren G 1993. Hostages as a commit- 
En device J Econ Behav. Organ. 21:43- 


Risman BJ, Ferree MM. 1995. Making gender 
visible. comment on Coleman. Am, Soctol 
Rev. 60.775—82 

Sanders JM, Nee V. 1996 Immigrant self- 
employment: the family as social capital and 
the value of human capital. Am. Sociol. Rev. 
61:231-49 

Schelling T. 1971. Dynamic models of segrega- 
tion. J Math Sociol 1 143-86 

Sen A. 1983 Economics and the family. Asian 
Dev Rev 1(2):14—26 

Shlapentokh V 1995 Russian patience: a rea- 
sonable behavior and a social strategy Eur. J. 
Sociol. 36 247-80 

Skvoretz J, Lovaglia MJ. 1995 Who exchanges 
with whom: structural determinants of ex- 
change frequency in negotiated exchange net- 
works. Soc. Psychol. Q. 58 163-77 

Skvoretz J, Willer D. 1993. Exclusion and 
power: a test of four theories of power in ex- 
change networks. Am Sociol. Rev. 58 801— 
18 

Smith HL. 1989, Integrating theory and research 
on the institutional determinants of fertility. 
Demography 26.171—84 

South SJ, Lloyd KM 1995 Spousal alterna- 
tives and marital dissolution Am. Sociol Rev. 
60:21-35 


214 HECHTER & KANAZAWA 


Stark R. 1992 Do Catholic societies really ex- 
ist? Ration. Soc. 4 261-71 

Stark R, Bainbridge WS. 1985 The Future of 
Religion. Secularization, Revival, and Cult 
Formation. Berkeley: Univ. Calif. Press 

Stark R, Bainbridge WS 1987 A Theory of Re- 
ligion New York: Peter Lang 

Stark R, lannaccone LR, Finke R. 1996. Reli- 

ion, science, and rationality. Am. Econ. Rev. 
2):433-37 

Thoits PA 1994 Stressors and problem- 
solving: the individual as psychological ac- 
tivist. J. Health Soc Behay 35 143-59 

Treas J 1993. Money in the bank: transaction 
costs and the economic organization of mar- 
nage Am Sociol. Rev, 58:723—34 

Treas J, Giesen D. 1996. Sex, gender, and ratio- 
nal choice: determinants of infidelity among 
married and cohabiting couples. Presented 
at Annu Meet. Am. Sociol. Assoc., Aug 16- 
20, New York 

Udéhn L 1995. The Limits of Public Choice 
London. Routledge 

Walder AG. 1986. Communist Neo-Traditional- 
ism. Berkeley: Univ Calif. Press 

Walder AG. 1992. Property rights and stratifi- 
cation in socialist redistributive economies. 


Am, Sociol, Rev. 57:524—39 

Walder AG 1995 Local governments as 1n- 
dustrial firms: an organizational analysis of 
China's transitional economy. Am J Sociol. 
101:263~301 

Warner RS. 1993. Work in progress toward a 
new paradigm for the sociological study of 
religion in the United States. Am J Sociol. 
98:1044—93 

Williams T. 1989. The Cocaine Kids. The In- 
side Story of a Teenage Drug Ring. Reading, 
MA: Addison-Wesley 

Wu Z 1996 Childbearing in cohabitational re- 
lationships. J. Marriage Fam. 5828192 

Yamagishi T, Cook KS 1993 Generalized ex- 
change and social dilemmas Soc. Psychol Q 
56.235—48 

Yamagish: T, Yamagishi M. 1994, Trust and 
commitment 1n the United States and Japan. 
Motiv. Emot 18:129—66 

Yamaguch: K, Ferguson LR. 1995. The stop- 
ping and spacing of childbirths and their 
birth-history predictors: rational-choice the- 
ory and event-history analysis. Am Sociol 
Rev. 60°272-98 

Young LA 1997. The Rational Choice Theory 
of Religion. New York. Routledge 


Annu Rev Sociol 1997 23 215-3] 
Copyright © 1997 by Annual Reviews Inc All rights reserved 


THE CHANGING ORGANIZATIONAL 
CONTEXT OF PROFESSIONAL WORK 


Kevin T. Leicht 
Department of Sociology, 140W Seashore Hall, University of Iowa, Iowa City, 
Iowa 52242; e-mail: leicht@blue.weeg.uiowa.edu 


Mary L. Fennell 


Department of Sociology, Box 1916 Maxcy Hall, Brown University, Providence, 
Rhode Island 02912 


KEY WORDS professionals, formal organizations, careers, institutional theory, career mobility 


ABSTRACT 

Our paper represents an attempt to stimulate continued sociological interest in 
professional work. Our review suggests that understanding professional work is 
central to understanding larger workplace changes in the late twentieth century. 
Some researchers have documented the increasingly diverse arrangements for 
the delivery of professional services, even as others point to a convergence of 
professional control around accountability and cost containment issues. Claims 
of diverging interests across work settings are often confounded by greater racial, 
gender, and ethnic diversity among professionals. We suggest that the study 
of professional careers provides an avenue for studying the diversification of 
professionals and work settings. Future research should also follow changes in the 
prestige rankings of different organizational arrangements from the viewpoint of 
professionals, professional associations, and clients. Finally we warn researchers 
that managerial groups seeking control of professional work have professional 
agendas of their own which constrain the possibilities for sweeping change. 


INTRODUCTION 


Sociologists’ long-term interest in professional work can be traced to the 
founders of sociology in their discussions of authority (Durkheim 1933, Bendix 
1956), bureaucracy (Weber 1947), and class conflict (Marx 1889). More recent 
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analyses of the nature of professional work and its implications have entered dis- 
cussions of postindustrial class and political systems (Giddens 1994, Inglehart 
1990, Manza et al 1995) and new social movements (Eder 1993, Jenkins & 
Leicht 1996). These larger sociological concerns have mirrored developments 
in the study of the professions themselves. Since Friedson last reviewed the 
changing nature of professional control (Friedson 1984), many developments 
have occurred in the study of professional work. Much of this work is mo- 
tivated by the belief that the relationship between professions and complex 
organizations 1s changing rapidly. 

Our review focuses on the changing nature of professional work and, specif- 
ically, the changing multifaceted relationship between professionals and em- 
ployment settings for professionals. We draw primarily on literature from law, 
medicine, engineering, and science, though our arguments apply to other sec- 
tors as well. We avoid defining precisely what professional work is, in favor 
of examining the role that organizations play in facilitating and placing limits 
on traditional professional work (Friedson 1986). Our review emphasizes the 
important role that professional careers play in examining the changing na- 
ture of professional work. Professional careers are strongly institutionalized 
in that they are shaped through socialization processes and generally accepted 
professional norms defining what “typical” and “distinguished” professional 
careers look like. The actual shape of those careers, however, is increasingly 
conditioned by opportunities within organizations, and the organizations within 
which professional work is done are themselves undergoing dramatic changes 
in both control structures and accountability. 

A focus on the institutionalization of professional careers serves to link the 
study of professional control to organizational theory. Employers of profes- 
sionals must contend with institutional norms about professional careers when 
they tap into the labor market for professionals. Organizations under specific 
environmental pressures shape reactions to pre-existing, institutionalized pro- 
fessional norms. In addition to the usual responses to institutional pressure 
(isomorphism——see Scott 1993), organizations may also attempt to shape pro- 
fessional norms, directly confront professional norms, or engage in avoidance 
of professional norms (Baum & Oliver 1991, Oliver 1991). 

The next section discusses the growing variation in settings where profes- 
sional work takes place and the implications this diversity has for notions of 
professional careers. This is followed by a discussion of the possible conver- 
gence of institutional practices around a singular model of professional careers. 
We also examine diversification of interests among professionals, fragmenta- 
tion of professional interests, and the fragmentation of work organizations more 
generally, with an emphasis on the growing professionalization of managerial 
work. Finally, we outline a future research program on professional careers that 
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takes these new developments into account, and we briefly consider method- 
ological issues pertinent to such research. 


THE INCREASING DIVERSITY OF PROFESSIONAL 
WORK SETTINGS 


Over a decade ago in Volume 10 of this series, Friedson argued that the nature 
of professional control has changed from that of a fairly informal collegial re- 
lationship among peers to more formalized relationships that have segmented 
professionals within fields along a number of dimensions. Chief among them 
1s the stratification of professionals into rank-and-file practitioners and super- 
visory elites. Although the exercise of bureaucratic controls by professionals 
over professionals has been well documented for many years across most of the 
traditional professions (Goode 1957, Scott 1982, Nelson 1988), we suspect that 
the locus of that control has begun to shift. Scott’s three basic models (1982) 
for embedding professionals within organizational settings—autonomous, het- 
eronomous, and conjoint—were still based on the assumption that professionals 
control professionals, even in bureaucratic settings. 

Now, however, the range of organizational settings in which professionals 
work has far exceeded earlier models of the solo practice, the partnership, 
group practice, large law firm or the large modern hospital. Professionals are 
found as salaried employees of for-profit and nonprofit organizations, of govern- 
ment units of all sorts, of freestanding nongovernment organizations (NGOs), 
endowed research organizations, universities, foundations, and corporations 
(Derber & Schwartz 1991, Abbott 1988, 1991). In many if not all of these set- 
tings, the control of professional work is no longer necessarily vested in peers, 
or even in the administrative elite of the profession; hierarchical control over 
professional work is often vested in professional managers of the employing 
organization. Some argue that the control of professional work by nonpro- 
fessionals extends to decisions concerning compensation (as in corporate law 
firms, see Tolbert & Stern 1991), and even to the the evaluation of professional 
performance (as in the establishment of effectiveness research-based protocols 
in medicine for performance review by third-party payers; see Hafferty & Light 
1995). 

For example, one of the fastest growing segments of the legal labor mar- 
ket is that of in-house legal departments of large corporations (Curran 1986, 
Halliday 1987, Leicht & Fennell 1996). Little is known about the structure 
of in-house legal departments, other than that new female law school gradu- 
ates are increasingly hired for such positions (Roach 1990) and that differential 
employment patterns exist for men and women lawyers in corporate legal de- 
partments. In medicine, a parallel expansion is underway in the rise of managed 
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care organization (MCOs), which contractually link physicians to third-party 
payers as either employees (as in HMOs) or contracted providers (as in pre- 
ferred provider organizations). In both types of MCOs, medical work and 
clinical decision-making are directly subject to control of the MCO (Dill 1995, 
Mechanic 1994). And both law and medicine exhibit trends toward incorpora- 
tion of the worksite into large, multiorganizational firms, either regionally, na- 
tionally, or internationally dispersed (Fennell & Alexander 1993, Stevens 1989). 

Given this expansion in the types of organizational settings in which profes- 
sional work is done, a plethora of unanswered questions remains concerning the 
career mobility of professionals either within or between these various settings. 
Traditional notions of which professional work settings are the most valued, 
prestigious, or well compensated are no longer reliable. The hallowed settings 
of the university-based teaching hospital, the private law firm, or the tenured 
faculty position have slipped on dimensions of stability, security, and autonomy 
(compensation has always been problematic in academia). Within different pro- 
fessions, though, we have no firm handle on the extent to which the relative 
prestige ranking of various settings has changed or is changing (Halliday 1987, 
Fligstein 1990, Hagan 1990), or to what extent newly developed or profoundly 
changing practice settings fit into those rankings (Leicht et al 1995), Although 
physicians at first resisted participating in MCOs, it would appear that now even 
the most esoteric specialists have decided to join the trend, many assuming that 
"the age of managed care" is inescapable. Given policy and insurer emphasis 
on primary care physicians as gatekeepers in the managed care system, how 
will specialty rankings within medicine change as a result of the uneven dis- 
tribution of referral power? Recent data suggest an increase in the number of 
medical students preferring primary care over specialization (Kassebaum et al 
1995), which may lead to long-term changes in the specialty structure within 
medicine. In corporate law, the traditional long-standing relationship between 
corporate customer and law firm is now being jettisoned in favor of either more 
frequent “spot contracting" with multiple firms, or the development of inter- 
nal legal departments (Tolbert & Stern 1991). Again, how have these changes 
affected status rankings within the legal profession? We also do not know to 
what extent significant mobility has occurred across diverse work settings, or 
whether mobility across settings is more valuable to the individual career (in 
terms of either earnings or prestige gains) than mobility within settings. We 
return to these issues in a later section of this paper. 


THE CONVERGENCE OF CONTROL STRUCTURES 


Even as the settings of professional work have become more diverse, the struc- 
tures and bases of control over professional work appear to be converging, both 
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across and within professions and across work settings. This would be predicted 
by institutional theory as the expected result of isomorphic adaptation over time. 
Institutional isomorphism is a process whereby groups of organizations move 
toward similar organizational forms and practices over time as they compete 
for political power and institutional legitimacy. DiMaggio & Powell (1983) 
outlined three mechanisms through which isomorphic change occurs: coercive 
pressure from governing bodies or the state; mimetic pressure on organizations 
to copy existing forms when faced by conditions of great uncertainty; and the 
normative expectations of powerful professional groups within organizations. 
However, when the norms are themselves in flux, and the power of the pro- 
fession becomes unstable, the balance may shift toward coercive and mimetic 
pressure, both of which may be more influential when uncertainty in a sector 
or organizational field is high (Fennell & Alexander 1987, Derber & Schwartz 
1991). Friedson’s work (1994) suggests another basis for the "new profession- 
alism" is control through expertise that 1s blended with an openness to inspec- 
tion/evaluation by both peers and external parties. We suspect such a model 
may be possible in the future, but at present indications are that control over 
professional work is slipping firmly into the realm of corporate and bureaucratic 
control, as Starr predicted for medicine some time ago (1982). Specifically, pro- 
fessional practice in many Sectors is increasingly driven by pressure for revenue 
generation and accountability to the state and to avoid legal prosecution. 
Examples can be found in many sectors. In medicine, the primary "norms" 
and emphases of the sector have shifted from access and quality issues to 
cost containment and service reduction (Alexander & D’Aunno 1990, Flood 
& Fennell 1995, Prechel & Gupman 1995). Clinical practice guidelines have 
been developed and used across subspecialties to standardize clinical practice 
and by managed care plans to deny reimbursement and shape cost-effective ser- 
vice delivery (Hafferty & Light 1995). Lawyers are pressed to build up stables 
of clients and maximize billable hours (Nelson 1988, Wallace 1995), while 
the decision-making of the judiciary is shaped by sentencing guidelines and 
tort law (Tonry 1993, Stolzenberg & D' Alessio 1994, Kramer & Ulmer 1996). 
Engineers are increasingly pressed to demonstrate the cost-effectiveness of their 
designs (Perrucci 1971, Whalley 1991), and scientists in both the private sector 
and within universities are expected to generate research grants as well as follow 
university and federal guidelines on the use of human subjects, data security, 
and reporting procedures (Long 1992, Long et al 1993, Gieryn 1983). Since 
the mid-1980s, university and college curricula have been modeled on guide- 
lines issued by the American Association of Colleges, and quality assessment 
in higher education emerged from the curriculum reform movement (Ratcliff 
1996, Banta & Associates 1993). In all of these examples, practice guidelines 
have both shaped the content of professional decision-making and provided a 
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vehicle through which accountability for professional practice can be explic- 
itly measured. Evaluation of professional performance in medicine, judicial 
sentencing, and academia have used compliance with practice guidelines as an 
important indicator of appropriate or high quality work. 

In many ways, this change in control within professions reflects changes 
discussed by Friedson (1984) and Hafferty & McKinlay (1993) concerning the 
impact of external threats to professional autonomy. Those threats impinge on 
multiple levels of professional work, from the terms of work to the conditions 
and content of work. For example, within medicine the intrusion of managed 
care has interjected an additional layer of authority between the provider and 
the patient in such a way that the referral to a specialist is no longer simply the 
decision of the primary care physician, since the choice of specialist and the 
treatment by that specialist is often constrained by contract relationships and 
reimbursement policies. Thus, the terms and conditions of medical work have 
changed due to structural changes in health care organizations. The content of 
medical work is then constrained by the use of clinical practice guidelines by 
HMOs and MCOs to structure clinical decision-making. Guidelines have been 
incorporated within managed care settings as part of their adoption of "total" or 
"continuous quality improvement" (CQT) methods to assess quality and control 
costs (Burns et al 1992, AHCPR 1995). Physician practice profiles are routinely 
constructed from chart audits and compared to guideline recommended practice 
patterns. Physicians who have aberrant profiles (usually over-usage of costly 
tests or procedures) are then encouraged to adjust their practice behaviors to 
better comply with the organization's preferred profile. Thus we find direct 
incursions on medicine's control over its core technology (Hafferty & Light 
1995). 

The popularity of the CQI movement in health care is itself an interesting 
organizational phenomena, which suggests the importance of mimetic isomor- 
phic processes of change. Very little evidence is available to justify claims of 
effectiveness of CQI in medical settings; nonetheless the Joint Commission on 
Accreditation of Healthcare Organizations (JCAHO) has rewritten its accredi- 
tation procedures on quality to promote and endorse the use of CQI in hospitals 
(JCAHO 1992). Hospitals and clinics have rushed to train their members in 
the language and methods of CQI, following the lead of both JCAHO and 
each other (mimetic isomorphism), in order to benchmark their organization's 
performance against the rest of the industry (Flood & Fennell 1995). 

Medicine is not alone in its acquiescence to external pressures for account- 
ability and cost containment. Universities are subject to severe financial pres- 
sures, as federal support for tuition grants and research funding has steadily 
declined. Rewards within the university are increasingly distributed on the ba- 
sis of revenue generation. Promotions, prestigious chairs, and salary increases 
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are given on the basis of a mix of criteria in which innovative scientific research 
that advances knowledge is still valued but is increasingly displaced by an em- 
phasis on scientific research that is funded or that generates patents (Bok 1993, 
Long 1992, Long et al 1993). At the same time, many academic disciplines in 
both the sciences and the liberal arts are subject to increasingly difficult pres- 
sures to retain talented faculty in the face of limited and dwindling rewards. 
The private sector has expanded its search for top quality professionals and re- 
presents an increasingly valued alternative career track for academics (Abbott 
1991), 


DIVERSIFICATION OF INTERESTS AMONG 
PROFESSIONALS 


The diversification of the locations of professional work and the increasing 
pressure to generate revenue and control costs have led to questions about the 
diversification of interests among professionals and the overall state of the pro- 
fessions generally. The debates surrounding these changes are often couched 
in terms of professionalization, deprofessionalization, and (in some cases) pro- 
letarianization (Derber & Schwartz 1991). Debates about professionalization 
and deprofessionalization often fail to separate the content of professional work 
from the context where the work is performed (see Abbott 1991). Further, the 
relationship between diversification of professional interests and deprofession- 
alization is usually left unspecified. 

Evidence for the diversification of professional interests is fragmentary. 
Researchers usually point to the growing racial, ethnic, and gender diversity 
of recent cohorts as a sign of the diversification of professional interests (see 
Menkel-Meadow 1989 and Epstein 1993 on law, Zuckerman et al 1991 on 
science, Chamberlain 1988 on academia, Morrison & Von Glinow 1990 on 
management, and Bonner 1992 on medicine). Other research points to well- 
publicized disputes within professional associations or challenges to their gov- 
ernance of professions (Hafferty & Light 1995, Halliday 1987). Relatively 
little research asks professionals from different social backgrounds working in 
different organizational contexts what they view as the nature of their profes- 
sional role or (perhaps more importantly) how they view their professional roles 
as they are carried out by professionals in other organizational contexts. Evi- 
dence regarding the effects of cost containment on professional divisiveness is 
relatively fragmentary as well, with most of the evidence confined to medicine 
(Hafferty & McKinley 1993, Leicht et al 1995). In theory, uniform pressures 
for cost containment should produce relatively uniform responses that miti- 
gate the effects of diversified professional interests on professional practice. 
However, there is still relatively little research that examines the effects of cost 
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containment on the content and context of professional work (but see Clark & 
Estes 1992, Feder et al 1987, Flood et al 1994, Prechel & Gupman 1995). 

Researchers' attempts to study the consequences of diversification and cost 
containment are often confounded by the diversification of professional work 
settings. In addition, itis often notclear which professional workers are affected 
and which workers were in fact the targets of systematic organizational changes 
(Leicht et al 1995). We suspect this is partly due to the fact that these changes 
have mostly affected individual practitioners and either (a) have not affected 
the content and context of work for other, more elite professionals at all, or (5) 
have enhanced the autonomy and responsibility of relatively elite professionals 
by placing them in control of cost containment initiatives. Further, almost no 
research has examined whether these social and organizational changes affect 
the content of professional practice or merely the context where professional 
work takes place (excepting Hoff & McCaffery 1996, Abbott 1988, 1991). 

To date, extantresearch has been imprecise in isolating these differences in di- 
versification effects and clarifying their relationship to different work settings. 
To what extent is interest-diversity the result of changes 1n the demographic 
composition of professional workers over time versus changes in organizational 
arrangements? To what extent do the two overlap? Has the growing demo- 
graphic diversity of professionals led to different "strategies of resistance" for 
different professionals facing threats to professional prerogatives? Some evi- 
dence suggests that diversification in professional work settings should lead to 
diversification of professional interests (see Abbott 1991, Hoff & McCaffery 
1996). But this effect could be confounded by a diversity of responses to the 
same organizational arrangements as well. In the last section we outline a 
research program to partially address these issues. 


CAREERS AND CAREER MOBILITY 
AMONG PROFESSIONALS 


Professional careers provide one linchpin for studying professional diversifi- 
cation and the organizational context of professional work. It is through the 
professional career that the benefits of professional life (status and income) are 
accrued. Career paths trace the training outcomes (degrees and location) and 
job transitions of individuals over their work lives. The professional career path 
represents the observable history of how individuals have been socialized into 
appropriate professional roles (Kagan & Nelson 1985, Nelson 1988, Nelson 
et al 1992, Powell 1985, Rosen 1989). The typical professional career path 
is likely to change if diversity in organizational arrangements leads to diver- 
sity of interests and demographic diversity among professional workers (see 
Kunda 1992). Further, it is by changing career outcomes that the interests of 
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professionals are likely to diverge. Current and future research should draw on 
the study of professional careers as an arena in which to examine professional 
diversification and fragmentation. 

One question raised by research on diverse professional settings is whether 
the labor market for professionals is segmented by either gender or function 
(Hagan 1990, Wallace 1993, Long 1992). Gender segmentation has been most 
widely publicized in popular debates about the “mommy track" in the legal 
profession and the glass ceiling in management. Both are structural barriers 
that act to depress mobility among professional women. In addition, most 
professions have been conceptually dichotomized by function into elites and 
practitioners. Elites represent a small percentage of the knowledge producers 
for the profession and are usually housed in universities, whereas non-elite 
practitioners consume and apply the knowledge produced and disseminated 
by elites (Friedson 1994). This difference has not been a major cleavage in 
most professions because elite members were responsible for the training of 
practitioners, elites depended on strong demand for and the relatively high status 
of practitioners in the wider society to bolster thetr own social status, and elites 
were vastly outnumbered by practitioners and shared similar incomes and social 
statuses (Abel 1989, Stinchcombe 1990). However, the widening diversity 
of professional work settings suggests that this relatively stable conception of 
professional stratification will (and has begun to) change. With this change will 
come an array of new questions about professional careers and stratification that 
were not salient for most professions uniil as recently as ten years ago. 

The first and most obvious question one could ask as organizational arrange- 
ments diversify is whether there are systematic differences in those recruited 
into professional roles in different settings. Governments and universities tra- 
ditionally have been the major avenues for career advancement for women 
and minorities in diversifying labor markets. There is considerable pressure 
placed on these work settings to conform to equal employment opportunity 
laws (Powell & Dimaggio 1991, Bridges & Villemez 1994). Within law, the 
growing use of in-house counsel has opened a mobility alternative to private, 
independent legal practice in law firms (Spangler 1986, Epstein 1993). In 
medicine, the rise of HMOs, PPOs, and other group practice arrangements has 
made salaried work a rising proportion of medical practice (Hafferty & Light 
1995). A key question to be addressed by future research is whether profes- 
sionals with different characteristics are more likely to end up in some specific 
organizational contexts and not in others. 

A second, related question is whether there is significant and typical career 
mobility across diverse work settings and (if so) what the consequences are 
for individual professionals who move between settings. This may be a highly 
significant indicator of fragmentation and segmentation in professional work. 
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If organizational contexts constitute segmented, separate labor markets that 
people enter into early in their careers and rarely (if ever) leave, this would 
signify that professional work was becoming fragmented. Alternatively, if the 
boundaries between organizational settings are relatively permeable and show 
no consistent pattern of career movements, this would suggest that professionals 
have been able to maintain a relatively coherent occupational labor market tied to 
generalist knowledge. We think a more likely pattern is one of path dependence, 
where there are systematic movements between work settings and where the 
prediction of the destination work setting depends critically on the immediately 
prior work setting. 

A further extension of this question is whether mobility across organizational 
contexts is more valuable for a professional career than mobility within specific 
contexts. This question is complicated by the problem that many new orga- 
nizational arrangements for professional work (for example, storefront legal 
service providers) do not have well-established professional and career stratifi- 
cation patterns, and the demographic and social backgrounds of professionals 
in both new and older established work settings are changing. 

À related concern involves the relative prestige rankings of different organi- 
zational arrangements. Traditional research on the relative prestige rankings of 
professional settings assumes that independent, fee-for-service practice is the 
standard by which all organizational arrangements are judged. However, nu- 
merous professional groups (such as accountants and engineers) have never es- 
tablished clear norms concerning organizational arrangements for professional 
work. Many new arrangements do not have clear counterparts that predate 
their appearance in the 1970s and 1980s, and this makes overtime comparisons 
of status rankings difficult. Further, it is not clear that younger professionals 
are as committed to long-established professional practice norms, especially in 
law and medicine (Halliday 1987). These problems could be partially (but not 
completely) overcome by surveying professionals at different stages of their 
careers, including older professionals whose entry into professional life pre- 
dates many of the new organizational arrangements. On a related issue that is 
critical for the overall evaluation of professional work, status, and privilege, we 
know little about the relative rankings of organizational contexts by clients of 
professionals, outside of public opinion surveys abort HMOs (Strang 1995). 

A second key to understanding the organizational context of professional 
work is to engage in systematic comparisons of the career mobility of profes- 
sionals and others who work in the same organizatioaal settings. In situattons 
where organizations are setting the agenda for the coniext of professional work, 
there should be divergent mobility patterns in different organizational settings. 
However, this is only part of the comparison that is needed. Researchers should 
also answer the question of whether the career patterns of professionals are 
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distinctive. If there is considerable isomorphic pressure from employing orga- 
nizations, then professional career mobility will look like the career mobility of 
other white-collar employees. It also suggests that professional norms are not 
well established or have been subordinated (Kunda 1992). As Abbott (1988) 
has argued, the focus on isomorphic pressures from organizations as well as 
professions means that researchers should study multiple professional groups 
interacting with each other and with nonprofessional groups in the same orga- 
nizational settings. 


A FUTURE RESEARCH PROGRAM ON THE 
DIVERSIFICATION OF PROFESSIONAL WORK 


Our review of the existing literature on organizational contexts and profes- 
sional work suggests that there are many exciting questions in need of research. 
Further, there is a wide array of documentary data sources on professionals 
that have yet to be fully exploited. Much of this documentary data comes from 
professional directories that provide historical information on individual profes- 
sionals (the legal profession has the richest information sources in this instance). 
Researchers will want to engage in extensive interactions with professional as- 
sociations in order to gain access to association documents, newspapers, media 
releases, and reactions to new organizational arrangements. Almost all of our 
future research suggestions draw on organizational theory and especially on 
institutional theory (see Scott 1995, Powell & DiMaggio 1991) and focus on 
the study of institutional arrangements, careers and mobility, and job content. 


Changing Institutional Arrangements 


At the institutional level, work is needed on the rise and decline of organizational 
arrangements. In some professions, established organizational arrangements 
for delivering services have carried considerable institutional weight. In fact, 
the prestige of a profession is often dictated by the ability of professionals to 
determine the organizational form under which service will be delivered. If 
medicine represents one extreme where (traditionally) there are strong insti- 
tutionalized norms dictating appropriate organizational forms for professional 
practice, engineering may be at the other extreme (compare Starr 1982 to Kunda 
1992). 

One fruitful avenue of future research at the institutional level would be to 
study the rise and decline of organizational arrangements using the theory and 
methods of organizational ecology (Hannan & Freeman 1989). Most new de- 
velopments in the organization of professional work in the United States have 
occurred in the twentieth century, and the history of new professional organi- 
zational forms is usually well documented as competitors seek to legitimize 
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new organizational arrangements through appeals to the state, consumers, and 
(ultimately) practitioners. Further, the interest groups that develop around new 
organizational arrangements often create support grcups and professional asso- 
ciations of their own to monitor and support the growth of the new organizational 
form. These interest groups and professional associations could be arich source 
of new data on the rise and decline of organizational forms. 

In addition to examining the usual processes that organizational ecology 
draws attention to (legitimation and competition), researchers will want to pay 
particular attention to newer processes discussed in the interaction between or- 
ganizational populations—competition, complementarity, and path dependence 
(see Carroll & Harrison 1994). Some organizational forms compete directly 
against each other for the same service niche. Other forms act in synergy with 
each other, aiding the legitimation and growth of each organizational form in 
itself. Further, the success of some organizational forms will depend critically 
on their start-up date relative to the existence and establishment of other organi- 
zational forms. To date, organizational ecology has been used to explain the rise 
and decline of HMOs (Strang 1995) and funeral directors (Torres 1991) but has 
not been applied to the development of other organizational forms in different 
professional settings (for a slightly different treatment of state bar associations, 
see Halliday et al 1993). 

At the microlevel, researchers should address questions of professional, 
client, and professional association acceptance of n2w organizational forms. 
This aspect of future research could employ survey and field research method- 
ologies along with comparative historical analysis. 

Quantitative survey research methodology could be used to study the ac- 
ceptance of new organizational arrangements by clients. In addition to its 
amenability to random sample survey research techn:ques, this research prob- 
lem could be addressed with in-depth interviews of corporate clients on the 
verge of adopting new organizational arrangements for the purchase of pro- 
fessional services, or clients who have made divergent choices ın the forms 
of professional services they consume. For example, we still have relatively 
little information about the decision-making processes leading to the corporate 
adoption of HMOs and employee reactions to them. The delivery of some pro- 
fessional services (accounting is the most obvious example) can be radically 
changed by the use of information technologies, yet the arrangements for deliv- 
ering services in this way often violate the institutional expectations of clients 
(Fischer & Dirsmith 1995). Survey and field research on actual client decision- 
making is needed in addition to research on the general ranking of different 
organizational arrangements for professional work from clients' perspectives. 

Researchers could also focus their attention in the field and in surveys on 
professional acceptance of organizational arrangemerts. As stated above, this 
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type of research needs to be sensitive to likely cohort differences in evaluations 
and exposure to new organizational forms. Again, sample surveys providing 
general rankings of different organizational arrangements are needed in addition 
to field research on reactions to (and rationales for) specific organizational 
arrangements. 

Finally, researchers could focus on professional associations (both dominant 
and dissident) and their reactions to changing organizational arrangements. 
Much of this work would involve comparative historical analyses of docu- 
ments, newspapers, press releases, convention minutes, and other literature 
produced by professional associations. These projects should focus not only 
on the association’s reaction to external developments, but also on the internal 
debates and reactions to conflict over new organizational arrangements and the 
constellations of interests involved. 


New Research on Professional Careers and Mobility 


Another needed track within this research program should focus on professional 
careers, using institutional theory (Leicht et al 1995) or prior research on careers 
in other settings (Rosenbaum 1984, DiPrete 1989, Diprete & Grusky 1990). 
Many directories of professionals allow for the study of cohorts of professionals 
as they enter and leave the labor market (see for example the Martindale-Hubbell 
Law Directory or the Law and Business Directory of Corporate Counsel, various 
years). More systematic methodological work in conjunction with professional 
associations could help to gauge the accuracy and coverage embodied in these 
directories so that methodological corrections can be made (where appropriate). 
Initially, researchers will probably do well to collect more systematic career 
information on a single organizational arrangement, putting studies together 
for comparative analysis later on. 

Ultimately we think that researchers should turn their attention toward sys- 
tematic comparisons of multiple organizational settings for a single existing 
profession. Key questions that studies of this kind could address include: 1. Do 
organizational settings draw professionals from different backgrounds? 2. Do 
career mobility patterns within and across settings lead to the clustering of 
elite practitioners? 3. How do the career patterns of professionals differ across 
organizational settings? 4. Are there systematic patterns of mobility across 
organizational settings, and 1f so, what are the consequences of that mobility? 

There are at least two other avenues for new research that need to be men- 
tioned. The growing heterogeneity of the settings where professional practice 
takes place provides an ideal opportunity to compare the careers of different 
professionals to each other and to those of other white-collar workers, especially 
managers (see Abbott 1988 for an example). One could argue that the latest 
rounds of corporate downsizing represent an attempt by managers to assume 
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professional status by substituting bureaucratic hierarchies with loosely cou- 
pled subcontracting and consulting arrangements (Boyett & Conn 1992), The 
study of professional careers must ultimately recognize that employment ar- 
rangements are changing rapidly for other white-collar workers as well. 

Most prior work on changing organizational arrangements ultimately men- 
tions (or at least implies) that the job content of professional work has changed. 
This is an area where many good field studies already exist (Abbott 1988, Kunda 
1992, Thomas 1995). Future field research should track job content changes 
very carefully, noting the progression of changes in organizational arrange- 
ments and following technological innovations and their actual use (see Barley 
& Kunda 1992, Rule & Attewell 1989). In addition to exhaustive field work 
on the actual content of job tasks across settings, we need research that con- 
ducts systematic interviews with various professionals to assess their reaction 
to changes in job content. 


The Professionalization of Everyone? 


In 1964, Wilensky asked the question whether virtually all occupations would 
or could become professionalized (Wilensky 1964). With corporate downsizing 
at the middle-management level and the rapid movement toward subcontracting 
arrangements, outsourcing, organizational work teams, and other nonbureau- 
cratically oriented work arrangements, we would argue that the organizational 
form in which many white-collar services will be delivered will at least super- 
ficially resemble traditional professional practice. 

Ultimately, the changing organization of professional work is complicated 
by other features of contemporary work organizaticns. Managers and those 
assigned the duties of administrative control in work organizations are also en- 
gaged in professional projects (Barley & Kunda 1997, Marglin 1974), and part 
of these projects involve the creation of workplace norms concerning autonomy 
and accountability that are similar to those enjoyed “or allegedly enjoyed) by 
professionals. The relationship between professional workers and employing 
organizations is complicated by the (real or alleged) decline of bureaucracy as an 
organizing principle of work organizations (see Boyett & Conn 1992). It is no 
longer true (if itever was) that managers are oriented toward control of their em- 
ployees. Instead, we argue that managers are orientec toward maximizing their 
own autonomy and that this explains their behavior nore readily than control 
orientations, especially with regard to managing professional employees. At 
a minimum, researchers addressing the relationship tetween professionals and 
organizational arrangements need to be sensitive to broader changes in work- 
place organization that go beyond the traditional emphasis on professionals in 
bureaucracies. 

Researchers interested in the changing organizational context of professional 
work face an exciting, new, and complicated empirical world for their research. 
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Our review has only touched the surface of the many issues with which re- 
searchers in this area are currently struggling. We hope that our review will 
stimulate new generations of researchers to enter an area of study that represents 
a unique interface between organizational theory, social stratification, and the 
historical analysis of institutional change in professional work. 
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ABSTRACT 

The measurement of age, age structuring, and the life course has become more 
problematic as the study of human lives has moved toward more detailed analyses 
and explanations. As we seek to better understand the course of human lives in 
contemporary and changing societies, the effective empirical measurement of its 
key concepts simultaneously becomes more pressing and more complicated. We 
first review the critical concepts of, and measurement strategies associated with, 
age and age structuring—including a discussion of different types of age, subjec- 
tive age identification, age norms and age expectations, critical life events, life 
phases, and life review. We then discuss state-of-the-art methods for measuring 
the life course, especially through life history and event matrices, and we close 
the chapter with some comments on the organization, analysis, and modeling of 
data. 


INTRODUCTION 


Within the social sciences, the measurement of age, age structuring, and the life 
course must be considered within the context of three very different debates and 
problem areas. First, it is related to the more general problem of how the lives 
of individuals are connected to the development and change of larger social 


233 
0360-0572/97/08 15-0233$08.00 


234 SETTERSTEN & MAYER 


collectives, of societies as a whole, and to the course of history (Elder 1996, 
Mayer 1986). Second, it is related to the interaction of biological, psychologi- 
cal, and social aspects of individual development (Baltes et al 1996). And third, 
it is related to a lively debate on the social construction of time (Nowotny 1989). 
We must limit our discussion to a select number or issues within these three 
areas, guided primarily by decisions about the impcrtance of certain concepts 
and measures for empirical research. 

The measurement of age, age structuring, and the life course has become more 
problematic as the study of human lives has moved away from global images and 
theoretical categories toward more detailed analyses and explanations. Over 
the last century, everyday ideas about what constitutz2s the "normal biography" 
have become less clear. While we have little problem conjuring up images 
about the life of a Yorkshire coal miner or Iowa farmer, our 1mages about the 
majority of lives in contemporary society seem to lack the same degree of clarity. 
Our scientific treatment of the life course must allow for the heterogeneity, 
discontinuity, and contingency that exists in present-day societies. 

Earlier notions of the life cycle, life span, or life course were at least in prin- 
ciple based on holistic conceptions of human lives (O'Rand & Krecker 1990). 
The dominant theme was borrowed from biology: maturation and growth, fol- 
lowed by decline and regression. Only as a minor subtopic did the idea of 
lifelong development, whether actual or potential, surface (Baltes et al 1996). 
As the study of human lives became more elaborate, sociological treatments 
were differentiated from those of biology and psychology; and even sociolog- 
ical treatments were segmented into a number of specialties (e.g. sociology of 
youth, sociology of old age, educational and occupational sociology, demog- 
raphy, and family studies). The challenge now lies in moving away from this 
fragmentation to a truly integrative study of the life zourse. 

Chronological age, a property of individuals, may stand as a proxy for bi- 
ological maturation, psychological development, membership ın larger social 
categories (e.g. cohort), or life stage or phase. Age structuring refers to the fact 
that every society uses age in important ways, and the experiences, roles, and 
statuses of individuals are often tied to age (Kertzer 1989). And the concept 
of the life course refers to the way in which social irstitutions shape and insti- 
tutionalize individual lives in the interconnected domains of education, family, 
and work (Mayer & Tuma 1990). Thus, educational trajectories, employment 
histories and occupational careers, and family and residential histories in part 
compose the life course. The life course might be understood as an objective 
“biography,” though the term biography has increasingly been used in a more 
narrow sense, and is usually understood to be the sudjective interpretation and 
construction of the life course by individuals themselves (Bertaux & Kohli 
1984). 
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While birth and death dates mark an individual’s historical presence, the 
concept of cohort is reserved for an aggregate of individuals anchored together 
in historical time (normally defined on the basis of birth year). It is ultimately 
used as a basis for searching for common collective properties (or “cohort 
effects”) brought about by the fact that historical events and conditions may 
impinge similarly on a cohort as they grow up, grow old, and die. In this 
chapter, unfortunately, we are unable to treat the problems associated with 
conceptualizing and measuring cohort. For a general introduction, interested 
readers might consult Ryder’s (1965) classic paper on the concept of cohort, 
and Rosow’s (1978) classic paper on the measurement of cohort. 

So while the life course can be viewed as an event history of a single indi- 
vidual, it can also be viewed at aggregate level (e.g. as something shared by 
a cohort), as a property of cultures themselves, and as something that can be 
compared across historical periods or between nation-states. 

We first review the critical concepts of, and measurement strategies associated 
with, age and age structuring—including a discussion of different types of 
age, subjective age identification, age norms and age expectations, critical life 
events, life phases, and life review. We then discuss state-of-the-art methods 
for measuring the life course, especially through life history and event matrices, 
and we close the chapter with some comments on the organization, analysis, 
and modeling of data. 


MEASURING AGE AND FORMS 
OF AGE STRUCTURING 


Age is one of the most primary social and cultural categories. Age usually op- 
erates interdependently with sex, and both age and sex are universal categories 
(LaFontaine 1978, Linton 1942, Parsons 1942). Age and sex serve as important 
dimensions 1n both cognition and social structure. Cognitively, they function 
as convenient categories for the mapping of social and cultural expectations 
about life experiences and roles. The categories of age and sex are also of im- 
portance to the individual personality, particularly where self-identification and 
self-perceptions are concerned. In social structure, the categories of age and 
sex are common dimensions by which institutions are organized (e.g. aspects 
of our familial, educational, and occupational institutions are organized by age 
and sex). 

As we mentioned earlier, age structuring refers to the fact that every soci- 
ety "has a distinctive way of dealing with age and, accordingly, that roles are 
in some ways age-linked" (Kertzer 1989, p. 5). Age structuring can be for- 
mal, at the level of social structure and social institutions, where key concepts 
have 1ncluded "cohorts, roles, and age strata, and their relationship to political, 
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economic, kinship, and other social systems and historical changes in these 
systems” (Kertzer 1989, p. 6). Age structuring cen also be informal, where 
key concerns have been how individuals go about cividing the life course into 
meaningful segments, what kinds of behavior are considered appropriate for 
individuals of different ages, and what notions exist about the proper timing 
and sequencing of life events and transitions. 


AGB STRUCTURING AND GENDER Most researchers have assumed that age 
structuring, whether formal or informal, may be quite different for men's and 
women’s lives, given that “the very division by gerder is in itself conditioned 
by age structuring, with the salience of gender distinctions dependent on age" 
(Kertzer 1989, p. 12). Similarly, Hagestad (1991) has noted that anthropologi- 
cal work on transitions has suggested that men and women may attach different 
social meanings to age, and that men and women may use different guidelines 
to measure the progress of their lives. For example, a long anthropological 
tradition focuses on rites de passage—Trites that structure movement through 
lifetime by age and sex, and provide a means by which roles are assigned and 
resources are allocated (Benedict 1938, Glaser & Strauss 1974, van Gennup 
1908). Given that men's and women's lives are often thought about and expe- 
rienced differently, our measures must be sensitive to these differences. 


AGE STRUCTURING ALONG OTHER SOCIAL DIMENSIONS  Population-level be- 
havioral patterns often show that the way the life course is actually experienced 
goes hand-in-hand with social location. The most prominent social dimensions 
in the literature on the demography of life-course events and transitions include 
sex (as discussed above), cohort/age, race/ethnicity, and social class (education, 
occupation, income, or some combination). While most scholars recognize that 
multiple definitions of the life course may emerge along various social dimen- 
sions, little empirical work has examined this variability, particularly at the 
level of subjective experience (Fry 1990). As a result, researchers should con- 
tinue to describe and explain these multiple definitions and experiences, and 
our measures must allow for these complexities. 


AGE STRUCTURING AND LIFE SPHERES In most Western societies, many insti- 
tutions are formally organized by age to some degree. For example, primary and 
secondary educational institutions are heavily age-graded (Angus et al 1988). 
Similarly, work institutions often structure prospects for promotion by age and 
seniority (Lashbrook 1996, Lawrence 1996, Rosenbaum 1984), and retirement 
policies and benefits are often structured around zge (Henretta 1994, Kohli 
1994). 

The degree of both formal and informal age structuring may vary by life 
sphere. For example, it has been argued that age is more salient in the economic 
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and political spheres than it is in the family sphere (for further explication of 
this debate, see Settersten & Hagestad 1996b). Nonetheless, because lives are 
lived in multiple spheres, and because these spheres are often experienced in 
dramatically different ways, our measures must be sensitive to these differences. 


AGE STRUCTURING ACROSS CULTURES Project A.G.E., a cross-cultural rese- 
arch project headed by anthropologists Fry and Keith, is devoted to studying the 
meaning of age and aging in a variety of cultures, including seven communities 
on four continents (Hong Kong; Swarthmore, Pennsylvania; Momence, Illinois; 
Blessington, Ireland; Clifden, Ireland; the pastoral Herero of Botswana, South 
Africa; and the !Kung Bushmen, of Botswana, North Africa), Project A.G.E. 
researchers have found that chronological age has the strongest salience in 
communities that are part of modern, industrialized societies. In the least 
industrialized communities, especially among the !Kung Bushmen and those 
in rural Clifden, Ireland, questions about age make little sense to respondents 
(Keith et al 1994), 

This example speaks to the fact that researchers must be sensitive to cross- 
cultural differences in how the life course is conceptualized, the assumptions 
they make about (or even impose upon) a given culture and, in turn, the ways 
in which age and aspects of the life course are measured. 

Similarly, there is a great deal of cross-national variation in the degree to 
which age rules and age preferences are embedded within laws and social poli- 
cies, as well as in the organization of social institutions that are connected 
to the state (Cain 1976, Eglit 1985, Neugarten 1982). For example, in many 
countries rights and responsibilities are explicitly structured by chronological 
age. Examples of these rignts and duties include age regulations around voting, 
driving, drinking, working (especially with regard to child labor), marrying, 
compulsory school, or seeking public offices. Eligibility for pensions, social 
services, or social insurance are also often dictated by age. Many of the discus- 
sions about legal ages center around questions of how soon in life individuals 
should be granted adult rights and obligations, or how late in life these rights 
and obligations should be retained (Cain 1976). 

General references to age-related categories without clearly specified ages 
are also common in legislation (e.g. “minors,” “the elderly") (Cain 1976). As 
Mayer & Schoepflin (1989, p. 200) have noted, "age groups are a favorite way 
of dividing up social problems," and larger age groups are often targeted for 
service bracketing. 

These studies indicate that the state and many of its policies often rely on, 
and even create, conceptions of the life course. A critical examination of the 
state and its policies may provide further insights about the ways in which age, 
age structuring, and the life course are treated in a society. 
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AGE STRUCTURING AND NEW TIME BUDGETS FOR THELIFECOURSE Researchers 
have begun to explore the ways in which the demography of an aging society, 
particularly increases in longevity, may have changed the nature of family 
and social life (Hagestad 1988, Riley 1986, Uhlenberg 1980). For example, 
roles have been prolonged, the sequencing of roles over lifetime has become 
more varied and complex, and family relationships have the potential to be- 
come more active and intense (Riley 1986). Similacly, Hagestad (1991, p. 25) 
and others have suggested that historical change along key demographic di- 
mensions (life expectancy, mortality, fertility, nuptiality) has led to new “time 
budgets of adulthood," where the average number of years spent in various 
work and family roles has dramatically increased (Andersen 1985, Gee 1987, 
Watkins et al 1987). However, from an empirical standpoint, we still have 
much to learn about the ways in which family and social life have specifically 
been transformed by these new demographic parameters, and our measures 
of age, age structuring, and the life course must take these parameters into 
account. 


Measuring Different Types of Áge 

Research on aging and the life course hinges upon the effective measurement of 
age. As Bytheway (1990) notes, somewhere in the process of doing our research 
the questions “How old are you?" or “When where ou born?" must be asked. 
But apart from error (e.g. difficulty remembering one's age or date of birth) 
or deception (e.g. intentionally misrepresenting one's age or birthdate) on the 
part of a respondent, the measurement of chronological age seems relatively 
straightforward. Chronological age can be measured in a number of ways, 
ranging in level of specificity—it can be expressed in days, months, or years. 
When the question is "How old are you?", the information gathered is normally 
expressed in "completed years" Of course, the same basic information is 
obtained with the question “What 1s the date of your birth?", but Bytheway 
(1990) reminds us that this question is quite different from the question "How 
old are you?" First, date of birth relates to a past event, while age is instead a 
current characteristic. Second, date of birth remains the same throughout life, 
while age is, of course, always changing. And third, complete birthdate (in 
month-day-year) is more precise. If only birthyear is asked, the calculation 
of age is likely to involve some rounding error. Same researchers have even 
asked both questions, but at separate points in the data-gathering process, as 
a consistency check. Jolicoeur and colleagues (1988) even make a distinction 
between postnatal age, measured from the day of birth, and total age, measured 
from the day of conception (obtained by correcting postnatal age with average 
duration of pregnancy, 0.75 years). 
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These examples serves as important reminders that we must ask ourselves 
why age is being measured and how it will be used; answers to these ques- 
tions will dictate the best format for measuring age. Regardless of how it is 
operationalized, age 1s often the most powerful piece of information about an 
individual (Schroots & Birren 1988). 

At the same time, we know that chronological age itself is an “empty” 
variable—we rarely assume that it 1s age itself that causes a behavior; instead, 
it is whatever age presumably indexes that is thought to be important. For 
example, age is often used as a predictor of an individual’s physical and emo- 
tional maturity, of an individual's readiness to assume certain responsibilities, 
or even of the probability that an individual will experience various medical 
or social problems (Chudacoff 1989, Fry 1986). Still, researchers often give 
little thought to what exactly it is that age indexes, and they often do not spend 
enough time specifying the mechanisms through which age plays a role 1n the 
models they develop and test. In addition, researchers should think more criti- 
zally about whether the things we hope to index by using age might instead be 
measured more directly, or about the conditions under which age is a relevant 
dimension to begin with (Ward 1984). 

Similarly, tests of age differences are often tests of arbitrarily-defined age 
brackets, sometimes into very broad bands (e.g. 55—74, 85+), or into smaller 
5- or 10-year bands. Ironically, all of the care taken to measure age accurately 
is lost when these brackets are constructed. Worse still, researchers habitually 
break down their data in this way without compelling rationale. Why are a 
particular set of age brackets meaningful? And why we should expect to find 
important differences along these divisions? We must also begin to think more 
critically about the circumstances under which we should “hold age constant" 
in our analyses, rather than routinely control for age without justification. 

However, as people grow older, and as variability among similarly-aged indi- 
viduals increases over the life course (Dannefer 1987), chronological age may 
become less useful as an index. As a result, Birren and colleagues (Birren & 
Cunningham 1985, Schroots & Birren 1988) suggest that alternative, and more 
specific, measures of age must be developed, measures that are more sensitive 
to individual differences. Birren & Cunningham (1985, p. 8), for example, 
discuss three distinct kinds of age: 1. biological age, which is defined by an 
individual's "present position with respect to his [or her] potential life span. 
Thus, an individual's biological age may be younger or older than his [or her] 
chronological age"; 2. social age, which is defined by an individual's "roles 
and habits with respect to other members of the society of which he 1s a part. An 
individual may be older or younger depending on the extent to which he shows 
the age-graded behavior expected of him by his particular society or culture"; 
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and 3. psychological age, which is defined by the “behavioral capacities of 
individuals to adapt to changing demands.”! Birren and his colleagues argue 
that while psychological age is clearly tied to biological and social ages, psy- 
chological age is a larger concept and includes the “use of adaptive capacities 
of memory, learning, intelligence, skills, feelings, motivations, and emotions 
for exercising behavioral control or self-regulation.” 

Birren suggests that these three types of age are highly independent from 
one another, except, perhaps, at their boundaries. While Birren's typology is 
useful for explicating the concept(s) of age, he anc his colleagues offer little 
detailed advice on how these different types of age might be measured, except 
to say that the measurement of biological age would involve the “assessment 
of functional capacities of vial or life-limiting organ systems," while the mea- 
surement of social age would involve “such aspects as the individual’s type of 
dress, language habits, and social deference to other persons in leadership posi- 
tions" and the social institutions to which an individval belongs. (Some of what 
Birren describes as "social age" is discussed below in the section Measuring 
Age Norms and Age Expectations). To our knowlecge, most of the discussion 
about biological, social, and psychological age has b2en conducted at a concep- 
tual level, and few studies have attempted to measure and empirically validate 
these three types of age and their independence. 

In an earlier discussion, Neugarten & Hagestad (1976, p. 36) also remind us 
that chronological age is often a poor indicator of biological, social, or psycho- 
logical age. First, there are important individual differences in development. 
As such, age is only a "rough indicator" of an individual's status along biolog- 
ical, social, or psychological dimensions. Second, age is "only meaningful in 
relational terms, as in signifying that one is younger or older than someone else, 
closer or farther from birth or death, or in marking progress compared to other 
persons in one's reference group.” Finally, chronological age is “meaningless 
unless there is knowledge of the particular culture arid of the social meaning[s] 
attached to given chronological ages.” Despite these problems, Neugarten 
& Hagestad (1976) argue that chronological age is “an indispensable index.” 
Clearly, chronological age has convenient and practical uses as an administra- 
tive and normative gauge—it is an easily measured, objective, and universal 
attribute. As a result, it has become a prominent criterion for classifying and 


! Schroots & Burren (1988) also review an additional type of age functional age. Functional 
age 1s defined 1n terms of an individual's ability to adapt to his or her environment. Functional age 
may exist as a fourth type of age in its own right, and an indivicual’s capacity for functioning in 
daily life could be under-tood through measures such as biological, psychological, and social age. 
Another conceptualization 1s that functional age may exist as a sabdimension of each of the three 
types of age discussed earlier—leading to a specialized conception of biological, psychological, or 
social age defined ın terms of functional critena 
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ordering society. Similarly, Birren & Cunningham (1985, p. 12) suggest that 
"because age is such a powerful index, it will probably always be used to clas- 
sify data while we are en route to explanations using variables other than the 
mere ages of individuals.” 


Subjective Age Identification 


Research on subjective age identification examines how old a person feels, into 
which age group he or she categorizes him or herself, or how old one would like 
to be, regardless of one’s actual age. This has been a lively tradition since the 
1960s. As an earlier review by Cutler (1982) notes, subjective age is typically 
measured with a single item phrased something like: “Do you feel that you are: 
young, middle-aged, old, or very old?” (e.g. Markides & Boldt 1983). Across 
studies, of course, there are slight variations in the item stem or in the number 
and type of response categories. In many gerontological studies, subjective age 
identification is restricted to whether an individual defines himself or herself as 
“old” or “very old.” Often, this type of age is referred to as identity age. 

Along these lines, Barak & Stern (1986) further delineate four aspects of 
identity age based on Kastenbaum and associates’ (1972) "Ages of Me” instru- 
ment: Feel age (“I feel as though I am in my...”), look age (“T look as though I 
aminmy...”), do age “I do most things as though I were in my . . . ”), and inter- 
est age ("My interests are mostly those of a person in his/her ...”). For Barak & 
Stern's items, the response categories are decades cf life, starting with the 1920s 
and ending with the 1980s; Goldsmith & Heiens (1992) have recommended that 
the 1910s and 1990s decades might also be added if appropriate for the sample. 
Together, these four dimensions are labeled cognitive age (Barak & Stern 1986). 

Other popular measurements strategies move away from identification with 
a larger age group, and instead ask for a more specific age response. The 
original version of Kastenbaum and colleagues' (1972) "Ages of Me" instru- 
ment elicited specific ages. For example, another version of feel age is simply: 
“How old do you feel?" or “What age do you feel on the inside?" (Underhill 
& Cadwell 1983; also see Cremin 1992, Thompson 1992). Similarly, desired 
age 1s also easily asked: “What age would you most like to be?". 

In studies of comparative or relative age identification, respondents are nor- 
mally asked whether they feel older, the same, or younger than most other 
people their chronological age. Here, the comparison is between oneself and 
other similarly-aged people. In addition, few investigators have examined a 
version of comparative age that keeps the comparison internal; that is, whether 
an individual feels older, the same, or younger than his or her chronological 
age (e.g. Baum & Boxley 1983). 

In most research studies, these various types of subjective age are often 
then compared to an individual's actual chronological age. In doing so, even 
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more types of age may result (for examples, see Barak [1987], Barak & Gould 
[1985]). Besides chronological age, other correlates of subjective age have been 
considered, the most popular being physical health, emotional health, gender, 
race and ethnicity, marital status, social participation, socioeconomic status, 
and retirement status (see reviews by Barak & Stern 1986, Baum & Boxley 
1983, Montepare 1991, Montepare & Lachman 1989). 

All of the measures described above are self-perceived ages—they ultimately 
reference the subjective ages of an individual him- or herself. One might also 
consider other-perceived age, or the age status(es) of individuals as evaluated 
by others (e.g. Lawrence 1974). The important point here is that this body 
of research generally does not anchor age in simple chronological terms, but it 
instead attempts to ground age at a phenomenological, subjectively experienced, 
level. 


Measuring Age Norms and Age Expectations 


Elementary, ascriptive categories often take on complex social meanings (e.g. 
influencing attitudes, behaviors, language), and age is no exception. For excel- 
lent reviews, see Hagestad (1990), Hagestad & Neugarten (1985), Keith et al 
(1994), and Neugarten & Hagestad (1976). 

Ageis one of the most salient social and cultural dimensions, and this salience 
may be reflected in informal age norms and expectations that govern behavior. 
In terms of sociological theory, (age) norms are prescriptions or proscriptions 
about behavior in the form of "should" and "shoukd not’; they are supported 
by consensus; and they are enforced through various mechanisms of social 
control, particularly social sanctions—positive, to keep people “on track,” and 
negative, to bring straying individuals "back into line" (Berger 1963, Blake & 
Davis 1964, Gibbs 1965). 

Assumptions about the salience of age, and the power of age norms, underlies 
much research and writing on the life course. Yet a great deal of "conceptual 
ambiguity and theoretical uncertainty" surrounds -he terms "age norm" and 
"age-normative" in the life-course literature (Hagestad 1990, p. 160). Com- 
mon uses of these terms range from (a) statistical age norms (descriptions of 
statistical regularity in the timing of life events and transitions in the larger pop- 
ulation or for subgroups of the population), to (b) optimal age norms (collective 
notions about the “best,” "ideal," or "preferred" ages to experience various life 
transitions), to (c) prescriptive and proscriptive age norms (collective, shared 
expectations about when certain transitions “should” or "should not" occur, 
respectively). 

While a handful of studies have intended to examine the presence of informal 
age “norms,” their use and measurement of the concept “norm” has overlooked 
its core prescriptive or proscriptive essence. Most investigators have used the 
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term "norm" either in the statistical or optimal senses described above, even 
though the prescriptive/proscriptive most fully-captures the essence of nor- 
mative. Along these lines, Marini (1984) has criticized past research on age 
“norms,” arguing that researchers have not actually measured social norms but 
have instead focused on the “ideal” and “preferred” domains. Researchers who 
have conducted their work atthe "optimal" level have often assumed that notions 
about what is “best,” “ideal,” or “preferred” are synonymous with notions about 
what "should" or *ought to" be. Similarly, when demographers find regularity in 
life-course patterns at a population level, they often believe that the regularity re- 
flects, and is driven by, cultural age prescriptions or proscriptions. Hogan (1985, 
p. 70), for example, has argued that “in the United States, the appropriate ages 
for events are [not well-specified], but the statistical regularities in the timing of 
events are suggestive of underlying [informal] norms." Marini (1984), however, 
has noted that behavior that is "statistically regular" may not be “normative,” 
just as behavior that is "statistically irregular" may not be “non-normative.” 

The pioneering work of Neugarten and her colleagues at the University of 
Chicago during the late 1950s and early 1960s explored age "norms" and age 
"constraints." Two of the instruments that were used in their Kansas City Study 
of Adult Life are most relevant here: “Timetables for Men and Women" and 
the “Age Norm Checklist" The 11-item “Timetables for Men and Women" 
instrument generally asks respondents what they think the best age is for ac- 
complishing a variety of transitions (e.g. “What do you think is the best age 
for a man to marry?" “What do you think is the best age for most people to 
leave home?’). For these timetable items, respondents give either a specific age 
or a small age band. Consensus was measured as the proportion who cite an 
age (or ages) within a short age span. Depending on the breadth of responses 
given for any particular item, the age span used for calculating consensus for 
that item ranged anywhere from 2 to 15 years; their approach was to select a 
band that "produced the most accurate reflection of the consensus that existed 
in the data." 

The 48-item "Age Norm Checklist" asks whether respondents “approve of, 
feel favorable" or “disapprove of, feel unfavorable” about a variety of behav- 
iors at different ages (e.g. “A woman who wears bikini on the beach—when 
she's 45; when she's 30; when she's 18" or "A man who buys himself a red 
sports car—when he's 60; when he's 45; when he's 25"). Responses are then 
scored to “reflect the degree of refinement with which the respondent makes 
age discriminations," with higher scores indicating greater age constraint. 

One article in particular, Neugarten, Moore, & Lowe (1965), has become 
the single standard citation as the classic study of informal age norms. These 
investigators have been criticized, though. for posing most of their questions at 
the optimal level (asking about the “best” age to accomplish various transitions) 
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rather than the prescriptive level (the age one “should” or “ought to” experi- 
ence major life transitions), and for not actually examining whether social sanc- 
tions or other consequences are thought to exist for violating these optimal-age 
timetables (Marini 1984). However, they have examined whether respondents 
“approve” or “disapprove” of persons of different azes engaged in a variety of 
more micro-level, lifestyle-related behavior (as discussed above). 

Later, Passuth & Maines (1981) essentially replicated Neugarten’s approach, 
and their conference paper is often cited as a follow-up to Neugarten, Moore, 
& Lowe’s (1965) classic study. The work of Fallo-Mitchell & Ryff (1982), 
Plath & Ikeda (1975), Zepelin et al (1987), Gee (1990), Peterson (1996), and 
Settersten & Hagestad (1996a 1996b) has also used Neugarten’s items in some 
capacity, though these studies have improved upon some of the original items 
(e.g. creating items about work for women that parallel previous items asked 
only about men; or splitting single items that asked about “people” into two 
separate items about men and about women), added new items (e.g. return- 
ing home) and eliminated others, added follow-up questions, altered scoring 
strategies, or varied research and sampling designs. 

For additional information on research instruments related to age norms and 
expectations, readers might also consult an earlier review by Hagestad (1982). 
Hagestad’s review covers other seminal instruments, including Wood’s (1972) 
"Age-Appropriate Behavior" (which asks 41 questions about the age at which 
a person is old enough to do various things, or the age at which a person is 
too old to do other things) and Bultena & Wood's (1269) Normative Attitudes 
Toward the Aged Role" (which asks 8 questions about whether it is proper for 
older men, women, or couples to engage in a various activities). 

Another body of research has examined the images associated with people 
of different ages, especially images about personality traits and characteristics. 
The most prominent of these instruments were part 5f Hagestad's (1982) com- 
pilation, including Hickey & Kalish's (1968) "Perceptions of Adults" (which 
prompts young respondents to give the ages that they associate with people 
who are “mean or unkind,” “lonely,” or “busy”; people who “like children and 
young people,” or people they like to help), Neugarten & Peterson’s (1957) 
"Age Association Items” (a 34-item index that asks for the age that respondents 
associate with a variety of feelings or behaviors, such as “a man who gets the 
most pleasure from his children,” “when a woman gets most pleasure from sex," 
“the prime of life for a man"), and Cameron's (1972, 1976) "Comparison of 
Age Groups" (which asks respondents to compare young, middle-aged, and old 
adults on dimensions of masculinity and femininity. and on dimensions of fun 
and happiness). 

Along these lines, the recent work of Heckhausen and her colleagues 
(Heckhausen et al 1989, Heckhausen & Baltes 1991) is also noteworthy. With 
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the goal of examining perceptions of developmental gains and losses throughout 
adulthood, Heckhausen uses a checklist of 385 adjectives covering perceptions 
of the personality (e.g. "impulsive"), social (e.g. “friendly”’), and intellectual 
(e.g. “ready-witted’’) characteristics of people of different ages. 


Critical Life Events 


Within the larger field of research on life events and transitions, there is a 
smaller body of work that focuses on "critical" or "stressful" events. This 
tradition begins most notably with Holmes & Rahe (1967) and their Social 
Readjustment Rating Scale (also sometimes referred to as the Schedule of 
Recent Events). Holmes & Rahe asked respondents to indicate whether they 
had recently experienced any number of 43 different events, ranging from events 
as major as “death of a spouse" or “jail term,” to events as minor as a “vacation” 
or “minor violations of the law." 

Many later inventories have essentially sought to improve upon Holmes & 
Rahe’s earlier measure, normally expanding the scope of items, rewording 
existing items that are ambiguous, or altering the scaling procedures. These 
include the Life Events Inventory (Cochrane & Robertson 1973), the PERI Life 
Events Scale (Dohrenwend et al 1978), the Paykel Scale (Paykel et al 1971), 
the Life Experiences Survey (Sarason et al 1978), and the Life Events Question- 
naire (Chiriboga & Dean 1978, Chiriboga 1984). Other measures move away 
from major life events, and instead focus on everyday hassles (e.g. “concerns 
about owing money,” “home maintenance") and uplifts (e.g. “daydreaming,” 
“feeling healthy,” “laughing”) (Holahan et al 1984, Kanner et al 1981, Lewin- 
sohn et al 1985, Wagner et al 1988). Many of these inventories have been used 
primarily with non-aged, and often clinical, populations. 

To remedy the age bias of these inventories, several investigators have in- 
corporated items that are more appropriate for elderly populations (e.g. “age 
discrimination," “trouble with Social Security"). These inventories include 
the Elders Life Stress Inventory (Aldwin 1992), the Geriatric Scale of Recent 
Life Events (Kahana et al 1982), the Geriatric Social Readjustment Rating Scale 
(Amster & Krauss 1974), the Louisville Older Persons Event Scale (Murrell & 
Norris 1984), the Small Life Events Scale (Zautra et al 1986), modified versions 
of the Schedule of Recent Events (Mensh 1983, Wilson 1985), and the Pleasant 
and Unpleasant Events Schedules (Teri & Lewinsohn 1982). However, many of 
the items in these inventories weigh heavily toward declining physical and men- 
tal health (e.g. “loss of hearing or vision,” “difficulty walking,” “painful arthri- 
tis,” “move to home for the aged"). Some measures also include events that hap- 
pen to children or grandchildren (e.g. “child married,” “death of a grandchild”). 

There are several problems often associated with these inventories. First, a 
very select set of the larger universe of possible events is sampled (as one might 
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expect), but events that are more common for young adults, men, whites, and the 
middle class are over-represented. In addition, negative and undesirable events 
are over-represented, and "non-events" (events that are expected to happen, but 
never occur) are omitted. Some people are, of course, also never “‘at risk" to 
experience certain events (e.g. one cannot experienze the death of a child if one 
has not become a parent). 

Second, not all events are equally important, and the procedures used for 
weighting events varies dramatically between studies. Some investigators 
have developed weights using independent raters; others asked the respon- 
dents themselves to weight events subjectively; and still others derived weights 
empirically—using group mean ratings or regression techniques. Many have 
abandoned the complexities associated with weighting, and have opted to use 
no weights at all. 

Third, there are problems associated with the reliability of recall over long 
time periods. However, most of the measures intend to tap events that have 
occurred within a limited span of time, usually within the last year. For those 
interested in the life course, though, the issue of recall is more problematic, 
since the unit of time becomes lifetime (or at least lifetime to the present day). 

Fourth, a number of problematic assumptions run through this literature: 
(a) That life events cause psycho-social disturbance (rather than the other way 
around). (b) That the direction of change is almost always negative (in part, 
this reflects the fact that most inventories focus on negative rather than posi- 
tive events). (c) That effects are primarily additive in nature (the greater the 
number of events experienced, the greater the impact), and (4) that the impact 
of life events is usually direct and short-term. Future work might better ex- 
plore whether psycho-social states instead predispose individuals to experience 
a certain set of events; whether and under what conditions positive change 
might result from negative experiences; whether relationships of other func- 
tional forms (especially nonlinear) might also be operating; and whether and 
how the effects of events may be delayed, dissipate, or grow over time. 

Fifth, researchers might better classify events along multiple dimensions 
(e.g. major versus minor; anticipated versus unanticipated; controllable versus 
uncontrollable; typical versus atypical; desirable versus undesirable; acute ver- 
sus chronic). In addition, researchers might begin to gather information about 
the resources to which people have (or had) access as a means for coping with 
various events, especially those events deemed as "stressful." 

Two measurement instruments stand out as exemplars: The Standardized 
Life Events and Difficulties Interview (SL), developed by Kessler & Wethington 
(1992) at the University of Michigan in the United States; and its predecessor, 
the Life Events and Difficulties Schedule (LEDS), developed by Brown and 
colleagues (Brown & Harris 1978) at Bedford College in England. The LEDS 
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has been used in England, Europe, Canada, and Africa. Both schedules provide 
a very comprehensive understanding of severe life events, ongoing stresses and 
difficulties, and life turning points. The LS is a more structured version of the 
open-ended, semi-structured LES, and is more in line with conventional survey 
techniques. Both schedules, however, require intensive training of interviewers. 
The pilot version of the LS instrument is monumental (exceeding 200 pages), 
and is currently being validated against the original LEDS. Unlike many of the 
inventories cited earlier, Kessler & Wethington, and Brown and his associates, 
do not seek to develop composite measures of events, 

A related approach is to focus on the plausible links between one or more 
specific life events and a specific outcome, usually one related to well-being 
(e.g. George 1989, Krause 1991, McLanahan & Sgrensen 1985, Palmore et al 
1985). 

For other discussions of the meaning and measurement of stressful life events, 
see Cohen (1988), Inglehart (1991), and Thoits (1983). For discussions on later 
life in particular, see Chiriboga (1989), Ensel (1991), and Murrell et al (1988). 


Measuring Life Phases 


We now turn to a brief discussion of larger life phases—how individuals in a 
given society construct a “mental map of the life cycle” (Neugarten & Hagestad 
1976), dividing the span of time from birth to death into distinct categories. 

An earlier review by Hagestad (1982) reviewed a number of measurement 
strategies which, with slight modifications, remain the most dominant ap- 
proaches. Neugarten & Peterson’s (1957) “Phases of Adulthood” instrument 
from the Kansas City Study of Adult Life is representative of these measures, 
asking “What would you call the periods of life, the age periods most people 
go through? At what age does each begin, for most people? What are the im- 
portant changes from one period to the next, for most people?” For strategies 
of this sort, respondents are asked to provide their own divisions and labels. 

Another common strategy is to provide respondents with predetermined la- 
bels and ask them to generate the age parameters they associate with those 
labels. Cameron’s (1969) “Age Parameters” questions are good examples of 
this: Cameron asks for the ages at which “young adult,” “middle-aged,” “old,” 
and “aged” begin. Similarly, Drevenstedt’s (1976) “Onset of Adult Phases” 
questions ask for age parameters, but also make distinctions between men’s 
and women's lives—asking for the ages at which a male becomes a “young 
man," a “middle-aged man,” and an "old man,” and the ages at which a female 
becomes a “middle-aged woman” and an “old woman.” Some of Neugarten & 
Peterson's (1957) "Age Association Items" are in this format as well. 

The simplest strategy has been to provide respondents with the label for the 
life phase and a number of preset response categories. The National Council on 
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Aging's (1975) “Onset of Old Age" adopts this approach, asking “At what age do 
you think the average man becomes "old"? “Under 40 years, 40 to 44 years, 45 
to 49 years, [etc, in 4-year brackets] 90 years.or older" Respondents were also 
given the options of “Never,” “It Depends,” “When he stops working,” “When 
his health fails,” “Other,” or “Unsure.” A follow-up question asks respondents 
to give their reasons for choosing a particular age band. In addition, a similar 
series of questions were asked about women. 

Anthropologists associated with Project A.G.E. developed an innovative 
strategy for examining perceptions of the life course (Keith et al 1994): the 
“Age Game.” Respondents are given a deck of cards that contain descriptions 
of various personae (e.g. "A male, high school graduate, single, is working, liv- 
ing with parents” or “A female, working, married, two children both of whom 
have recently married” or “A male, widowed, retired, living with adult chil- 
dren”), and are asked to sort the deck based on guesses about the ages of the 
people described on the cards. After respondents have finished the sorting pro- 
cess, they are asked: “What name would you give to the age-bracket or the 
age-group of the people you have placed in this pile? Are there any other words 
you might use to describe this general age group? Roughly, what is the age or 
range of ages in terms of years of the people described in this pile?" 

Theorists and researchers have also relied on life stages or developmental 
phases as a favorite way of conceptualizing human lives from birth to death 
(e.g. Eisenstadt 1956, Erikson 1980). The differentiation of life phases has been 
seen as an historically emergent property of modern societies, beginning with 
“childhood,” “adulthood,” “old age" (Borscheid 1992, Ehmer 1990). Childhood 
was eventually differentiated into “early childhood,” “youth and adolescence,” 
and “post-adolescence.” Adulthood was also segmented into “early adulthood,” 
“mid-life,” and “old age.” Old age was split into the “‘young-old” and the “old- 
old" (Neugarten 1974), or the "third and fourth ages” (Laslett 1991). Note, 
however, that Neugarten’s original formulation of young-old and old-old was 
meant to distinguish that proportion of the elderly po»ulation who “are relatively 
healthy, relatively affluent, relatively free from traditional responsibilities of 
work and family, and who are increasingly well-educated and politically active" 
from those who are not-—regardless of their ages (Neugarten 1974, p. 187). 
Over time, gerontological researchers have increasingly defined the young-old 
and old-old in purely chronological terms—with the young-old defined as those 
between 65 and 74, or between 65 and 84; and with the old-old defined as either 
those aged 75 or older, or aged 85 or older. 

Another approach has been to divide the life course into a series of positions 
vis-à-vis the welfare state: as non-recipients and non-contributors (e.g. chil- 
dren, housewives), contributors (e.g. the economically active), and recipients 
(e.g. elderly, unemployed, disabled, maternity leave) (Mayer & Schoepflin 
1989). 
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Researchers who have concentrated on the economic participation of males, 
and the functional requirements of capitalist societies, have described the “‘tri- 
angularization of life" (Smelser & Halpern 1978) and the "tripartition of the 
life course" (Kohli 1994). That is, the life course is composed of three distinct 
segments, an early part devoted to education and training, a middle part devoted 
to work, and a final part devoted to retirement. 

As a result, education- and work-related activities may mark the transition 
from one life phase to another. "Old age,” for example, is often defined by 
eligibility rules for the receipt of pensions or Social Security benefits, or by 
the legal or actual retirement ages. However, as a measure of “old age,” age 
at retirement entails a number of obvious problems. First, many older persons 
remain employed in some capacity after retirement, while others do not have 
gainful work from which to retire. Second, the receipt of pension payments 
and social security benefits may not coincide with age at retirement. Third, age 
at retirement tells us little about an individual's level of physical and mental 
functioning, degree of autonomy or dependency, or level of social participa- 
tion. The distinction between the “young-old” and the “old-old” (cited above) 
has been prompted by the facts that: (a) average age at retirement has fallen 
markedly during the last decade, and is now between the ages of 55 and 60 in 
most advanced countries (Kohli 1994), (b) the upper end of the life span has 
extended, and (c) there is an enormous degree of variability in the functional 
ability of older persons at any given age (Maddox 1987). 

There are now several good examples of strategies for measuring the multi- 
dimensional process of aging using interview or survey methods (see Berkman 
et al 1993, from the MacArthur Foundation Research Network on Successful 
Aging; Baltes et al 1993, from the Berlin Aging Study). Nonetheless, we still 
do not have well-developed measures of the transition to dependency in old age. 
While measures related to Activities of Daily Living (ADL, IADL) capture de- 
clining abilities for self-care, these measures do not take support structures and 
social settings into account. In this area, measurement might be improved by 
adopting the logic of life course analysis (discussed later). For example, one 
might conceptualize the transition to dependency as a series of different states 
(e.g. first being unable to work, followed by the receipt of informal help, the 
receipt of formal help in the home, and institutionalization). The duration of 
time spent in these various states could then be used to better understand the 
course of this process. 


Life Review and Life History 


The continuity of subjective biographical identity, and of personality character- 
istics and behavioral dispositions, provides a sense of integrity 1n life. While 
individuals may engage in reminiscence and life review activities throughout 
the life course, these activities seem especially salient during middle and old 
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age, functioning to bridge earlier and later segments of life (Erikson 1980, 
Levinson 1986, Neugarten 1968, Staudinger 1989%.* In addition to their re- 
search value, reminiscence and life review techniques are also popular ther- 
apeutic tools (e.g. Butler & Lewis 1982, Coleman 1986, deVries et al 1990, 
Haight 1991). Life review and reminiscence activites often help individuals 
achieve a sense of self-worth, coherence, and reconciliation with one’s past; 
maintain perceptions of competence and continuity; and pass on one’s cultural 
heritage and personal legacy (Wong & Watt 1991). 

The most common approaches to life review and reminiscence involve un- 
structured or semi-structured interviews. Interviews are usually conducted 
with individuals, but family and group sessions may be conducted as well 
(e.g. Botella & Guillem 1993, Birren & Deutchman 1991, deVries et al 1990). 
Examples of key questions include ^What do you see as important or significant 
in your life?" (Lewis 1971), "What family members have had a major impact 
in shaping your life? Why?" (deVries et al 1990), "Describe three memories 
that you've been thinking about lately" (David 1990), “What were the most 
important events? What were the most difficult decisions?” (Tismer 1971), 
“Tell something of your past that is most important to you—that is something 
that has had the most influence on your life" (Wong & Watt 1991). Others have 
had revealing discussions centered around cherished objects (e.g. photographs, 
jewelry, books, paintings) (Sherman 1991). 

Other investigators have not examined the content of reminiscence, but have 
instead asked about their frequency or affective quelity. For these purposes, 
Havighurst & Glasser's (1972) questions are commonly used: "Looking back 
over the last several weeks, would you say you have done a great deal, some, 
or very little reminiscing?’ and “Would you most often characterize your 
reminiscence activities as most pleasant, pleasant, or unpleasant?” 

Another interesting approach to life review is to ask a series of questions 
about life revision; that is, to ask respondents to identify what they would do 
differently if they had their lives to live over again. For example, DeGenova 
(1992) created a life revision index based on 35 questions with the stem “Tf you 
had your life to live over again, how much time would you spend ...’, with the 
response categories being “much more, more, same, less, much less.” The items 
in DeGenova's index cover 7 different life domains: family (e.g. "developing 
close relations with your children"), work (e.g. "worrying about your job”), 


Staudinger (1989) notes that while the terms “life review" and “reminiscence” are often used 
synonymously, they should be recognized as distinct Reminiscence, Staudinger argues, is the 
simple recalling of life events, something that can be “triggered quite unwillingly by the reminiscer”’; 
life review, on the other hand, involves the interpretation and evaluation of those events and often 
requires more “active engagement of the person who is reviewing his or her life" (p. 71) Life 
review, as Butler (1963, p. 67) reminds us, "15 not synonymous wich, but includes, reminiscence.” 
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friendships (e.g. “keeping up with good friends"), health (e.g. “taking good 
physical care of your body"), education (e.g. “pursuing your education"), leisure 
(e.g. "traveling", and religion (e.g. “developing your spirituality"). Of course, 
additional depth could be gained through a series of follow-up questions on 
what exactly respondents would do differently, and why they would allot their 
time in new ways. 

A related approach is to ask subjects to graph their lives. For example, 
Runyan (1980) and several earlier studies (Clausen 1972, Pressey & Kuhlen 
1957) gave respondents a graph with ages 5 through 40 calibrated on the hor- 
izontal axis, and personal morale (spanning from 0, or “rock bottom," to 9, 
“absolute tops") on the vertical axis. Respondents were asked to draw a curve 
indicating their degree of happiness in over those years, and were then asked 
to discuss the reasons for their ratings. Similarly, others have administered 
blank-grid life graphs with only 5-year age demarcations (from 0 through 80) 
on the horizontal axis, and have asked subjects the following question: “Can 
you visualize how your life could be put into a graph? That is, ups and downs, 
level periods, rises and declines, etc. Assuming that you live until at least 80 
vears of age, how do you think that a graph of your life will look?” (e.g. Back 
& Bourque 1970, Bourque & Back 1977, Back & Averett 1985). Depending on 
the age of the subject and the age span of the graph, some subjects respond in 
terms of how they think their lives will unfold (prospectively), others in terms 
of how their lives actually did unfold (retrospectively), and still others provide 
a mix of both. While these graphs may be used with any age group, this method 
may be especially useful in research with middle-aged and aged respondents, 
particularly as a way to structure life review discussions or to gather life history 
data. 

Readers interested tn other applications of reminiscence and life review meth- 
ods might consult two recent edited volumes by Haight & Webster (1995) and 
Hendricks (1995). Life history methods, which are related to life review meth- 
ods, are a primary means for collecting data on the life course. These methods 
are considered in detail in the next section. 


MEASURING THE LIFE COURSE 


As discussed in the introduction, the life course refers to the “social processes 
extending over the individual life span or over significant portions of it, espe- 
cially [with regard to] the family cycle, educational and training histories, and 
employment and occupational careers. The life course is shaped by, among 
other things, cultural beliefs about the individual biography, institutionalized 
sequences of roles and positions, legal age restrictions, and the decisions of 
individual actors” (Mayer & Tuma 1990, p. 3). 
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Life trajectories, transitions, and events are cent-al concepts in the study of 
the life course (Elder 1985). The concept of the life trajectory, similar to that of 
a career, is long in scope, and refers to “a pathway defined by the aging process 
or by movement across the age structure" (Elder 1985, p. 31). The individual 
life course is composed of multiple, interdependent trajectories (for example, 
Work, family, and educational trajectories). 

Trajectories are punctuated by a sequence of successive life events and tran- 
sitions. Events and transitions are brief in scope, and refer to changes in state. 
However, an event is usually conceptualized as a reletively abrupt change, while 
a transition is usually conceptualized as a more gradual change. Transitions 
and events are always placed within a larger trajectory, and the trajectory gives 
them “distinctive form and meaning” (Elder 1985, p. 31). 

The concepts of timing, sequencing, duration, end spacing are often used 
to describe life events, transitions, and the characteristics of larger trajectories 
(Hagestad & Neugarten 1985). Timing refers to the age at which given transi- 
tions occur in the life course. Sequencing refers to the order in which transitions 
are experienced over the life course. Duration refers to the length of time spent 
in any give state. Spacing refers to the amount of time between two or more 
transitions, the pace at which multiple transitions are experienced. 

In addition to these basic concepts, the researcher must identify the life do- 
mains and state spaces to be measured. A natural and convenient way to do this 
is to structure data collection on activities in the domains of family (e.g. parents, 
siblings, spouses or partners, children), education and training (e.g. schools, 
training places, credentials), employment (e.g. emp.oyment contracts, occupa- 
tional activities, firm membership, sector location), and residence and house- 
hold (e.g. places of residence, household composition). 

While these dimensions describe the primary activities across life, a more 
complete picture of the life course must also include more marginal periods 
and events— such as brief periods of training, second or part-time jobs, periods 
of unemployment or sickness (for examples, see Mayer & Brückner 1989). 


Calendars, Life History Matrices, and Event Histories 

Life history matrices plotlife domains, and specific events and transitions within 
them, over time (Balan et al 1969, Freedman et al 1988, Frank & van der Burgh 
1986; readers interested in working with archival data might consult a recent 
monograph by Elder et al 1994). In life history matrices, the first column cali- 
brates time, and the remaining columns specify domains and events of interest. 
The date an event began is indicated in the appropriate row, and its duration is 
then charted vertically. The advantage of this type of instrument 1s that activ- 
ities in multiple life domains are simultaneously mapped onto one frame, and 
the relationships between them are easily seen. In addition, inconsistencies are 
easily detected and can be double-checked immediately with the respondent. 
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Many of these instruments are also easily self-administered. The great disad- 
vantage of these measures is that little information can be noted in matrix cells 
(e.g. it is difficult to note detailed information about occupational titles). As 
a result, only a very crude and, in most cases, insufficient picture of the life 
course can be constructed using life history matrices. 

A more adequate means for collecting data on life histories is exemplified 
by the Life History Questionnaire applied in six surveys of the German Life 
History Study (Mayer & Brückner 1989), which built upon the Norwegian Life 
History Study (Rogoff-Ramsoy 1973). In developing this instrument, the goal 
was not only to extract monthly data for several life domains, but also to obtain 
a wide variety of contextual information on each state. For these purposes, 
complete event histories (from birth to the present) were collected in each life 
domain, beginning with the residential history. In the case of residential history, 
for example, data are gathered on every residential "state," including the month 
and year that mark the beginning and end points, the size and type of residence, 
geographic location, and the composition of household. The residential history 
is taken first because it provides a good anchor for the recollection of informa- 
tion on other trajectories. In a similar format, general schooling, occupational 
training, primary employment, secondary employment, and military service 
trajectories are charted and then reviewed for all periods that are unaccounted. 
Thus, the main content of these surveys becomes the life history itself (for 
theories, rationale, and applications, see Blossfeld et al 1991, Mayer & Tuma 
1990). 

In the German Life History Study, this technique was applied in both personal 
and computer-assisted telephone interviews with persons between the ages of 
29 and 103 (Brückner 1994). It results in rich and accurate time-continuous 
data. Of course, there are high costs associated with interviewing time (in most 
cases, between 1.5 and 3 hours), the editing process (including return calls 
to respondents to clarify inconsistencies), and the training of interviewers and 
editing staff. When respondents agreed, their personal and telephone interviews 
were also audiotaped; those recordings were then used during the process of 
editing data. Shorter versions of this instrument are possible when either the 
number of life domains and/or the range of contextual variables are restricted. 


Retrospective Versus Prospective Measurement Strategies 


Although retrospective methods are often used in cross-sectional studies, the 
data itself, at least in principle, may approach that obtained by a sufficiently 
long panel study (that is, respondents are interviewed only once, but information 
is gathered on the entire life course). One-time retrospective surveys offer an 
excellent alternative to longitudinal designs, especially for those interested in 
the life course as a whole (Featherman 1980). The most common problem 
associated with retrospective data concerns the accuracy and precision of recall 
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(Dex 1991). This criticism, however, applies to any kind of autobiographical 
data, regardless of research design. 

When retrospective data are cross-checked against data from archival sources 
(e.g. registry, school, or firm records) some degree o? inconsistency is likely to 
emerge. This incongruence 1s often taken as evidence that retrospective data 
are not completely reliable, even though archival data may be equally fallible. 
However, once common sources of error are known, these errors can be taken 
into account when research instruments are being designed; as a result, the 
amount of measurement error may be substantially reduced (Brückner 1990). 
An advantage of most life history instruments is that they address complete 
histories 1n a systematic fashion, embedding events and transitions in both 
diachronic and synchronic life contexts. 

Data on the life course can also be collected prospectively (e.g. in panel stud- 
ies with multiple measurement points). For example, zhe Panel Study of Income 
Dynamics (PSID) in the United States, the first major household panel, became 
the model for similar studies in Germany, the Netherlands, Great Britain and 
Sweden. However, even panel designs are not exempt from problems associ- 
ated with retrospective measurement (Featherman 1980), since information is 
usually gathered on the penod of time before the survey began, and on the pe- 
riod of time that elapses between measurement points. The major disadvantage 
of panel studies is, of course, that one must wait many years until a sufficient 
span of the life course can be described. 

Readers interested in further discussion of issues related to data quality in 
longitudinal research might consult a recent edited volume by Magnusson & 
Bergman (1990). 


Data Organization and Analyses 


Standardized, quantitative life histories from large samples have been collected 
since the late 1960s (Balan et al 1969, Blum et al 1969, Rogoff-Ramsoy 1973). 
But it was not until the mid- 1970s and early 1980s that adequate methods for 
data organization, retrieval, and analysis became available (e.g. Allison 1984, 
Blossfeld et al 1989, Courgeau & Lelievre 1992, Flinn & Heckman 1982, Griffin 
1993, Mayer & Tuma 1990, Singer & Willett 1991, Yamaguchi 1991). 

The recent sociology of the life course has almost accomplished its mission 
to show that lives are differentiated, event-related, and episodic, and must be 
examined and analyzed in a step-by-step piecemeal fashion. Descriptive tech- 
niques for survival data and stochastic models for discrete events in continuous 
time (known as hazard rate or event history models? have been shown to be 
especially useful in analyzing life history data. A major charge of critics has 
been that these methods atomize lives into unrelated transitions and episodes. 
In contrast, these critics have called for methods to better represent total trajec- 
tories with typologies (Abbott 1983, Abbott & Forrest 1986, Abbott & Hrycak 


AGE AND THE LIFE COURSE 253 


1990, Chan 1995). One technique in particular-——“optimal matching," which 
seeks to unravel sequence patterns—has been proposed, but it remains to be 
seen whether it holds its promise. 

In response, proponents of event-history methods have insisted that the po- 
sition of their critics is doubly misleading: They argue that both trajectories 
and the richness of individual biographies can, in fact, be modeled well with 
event-history procedures. In addition, they suggest that typological approaches, 
especially those that seek to represent sequences, make inadequate use of the 
temporal character of the data, and are insufficient for covering the entire range 
of possible pathways in most empirical applications (Mayer 1986, Mayer & 
Tuma 1990). 

Readers interested in learning more about various techniques for analyzing 
longitudinal data, including the analysis of sequences, might consult a recent 
casebook of applications edited by Magnusson and colleagues (1991). 


CLOSING STATEMENT 


We have argued that the measurement of age, age structuring, and the life course 
have become more problematic as the study of human lives has moved toward 
more detailed analyses and explanations. Our scientific treatment of the life 
course must allow for the heterogeneity, discontinuity, and contingency that 
exists in present-day societies. Earlier notions of the life cycle, life span, or 
life course, were at least in principle based on holistic conceptions of human 
lives. However, as the scientific treatment of human lives became more elabo- 
rate, our approaches became increasingly fragmented not only across academic 
disciplines, but also within them. Yet life-course research must address many 
different analytic levels if it is to be comprehensive. The challenge now lies 
in moving away from this fragmentation to a truly integrative study of the life 
course. Herein lies our paradox: As we seek to better understand the course of 
human lives in contemporary and ever-changing societies, the effective empir- 
ical measurement of its key concepts simultaneously becomes more pressing 
and more complicated. 


Visit the Annual Reviews home page at 
bttp:/Arww.annurev.org. 
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ABSTRACT 

Recent work in cognitive psychology and social cognition bears heavily on con- 
cerns of sociologists of culture Cognitive research confirms views of culture as 
fragmented; clarifies the roles of institutions and agency; and illuminates supra- 
individual aspects of culture. Individuals experience culture as disparate bits of 
information and as schematic structures that organize that information, Culture 
carried by institutions, networks, and social movements diffuses, activates, and 
selects among available schemata. Implications for the study of identity, collective 
memory, social classification, and logics of action are developed. 


INTRODUCTION 


The study of culture in everyday life remains a virtuoso affair. Interpretive 
studies offer great insight but fail to build on one another. Cultural theory has 
become highly sophisticated but not fully operational. These riches ready the 
field for takeoff, like the study of social stratification in Sorokin’s day (1957 
[1927]). But before the study of lived culture can become a cumulative enter- 
prise, scholars must clarify the cognitive presuppositions behind their theories 
of what culture does and what people do with it, and the fundamental concepts 
and units of analysis (Jepperson & Swidler 1994, Wuthnow 1987). 

Recent work in cognitive psychology and social cognition provides resources 
for both tasks. After describing recent convergence between cultural sociol- 
ogy and psychology, this chapter considers lessons of recent work on cognition 
for presuppositions about the nature of culture; develops implications of these 
lessons for sociological work on identity, collective memory, social classifi- 
cation, logics of action, and framing; and points to key problems that remain 
unsolved. 
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Rather than offer an exhaustive review of cognitve sociology per se (see 
Zerubavel 1997) or work in psychology relevant tc culture (see D'Andrade 
1995), I emphasize tensions and affinities between recent cognitive research 
and work in the sociology of culture with the aim of bringing the former into the 
service of the latter. I focus on how people use culture, rather than the production 
of culture, ideology, or culture embedded in the physical environment. The 
point is not to psychologize the study of culture, but to lay a foundation for a 
view of culture as working through the interaction of saared cognitive structures 
and supra-individual cultural phenomena (material culture, media messages, or 
conversation, for example) that activate those structuces to varying degrees. 


SOCIOLOGY AND PSYCHOLOGY: POINTS 
OF CONVERGENCE 


A handful of sociologists have appreciated the potential of cognitive science 
to inform sociological work on culture (Carley 1989, Cicourel 1973, Schwartz 
1981, White 1992), and some social constructionists have anticipated impor- 
tant results of cognitive research (Berger & Luckman 1967, Garfinkel 1987 
[1967], Zerubavel 1991), For the most part, however, sociologists of culture 
have ignored relevant work by cognitive psychologists, social psychologists, 
and public-opinion researchers. This omission reflects a mismatch between 
the modal intellectual styles of humanistic, interpretively oriented cultural so- 
ciologists and experimentally oriented positivistic psychologists, as well as the 
disappointing legacy of Parsons’ efforts at disciplinary fusion, which psychol- 
ogized culture, reducing it to shared values, norms, and attitudes. 


Sociology: More Complex Views of Culture 


In recent years, however, common ground between sociology of culture and 
psychology has grown. The major development within sociology has been 
a shift to a more complex understanding.of culture. Thirty years ago, most 
sociologists viewed culture as a “seamless web" (Swidler 1997), unitary and 
internally coherent across groups and situations. In effect, culture was por- 
trayed as a latent variable influencing in common suck manifestations as media 
images, responses to attitude questionnaires, and the values embodied in ev- 
eryday practices. Individuals were presumed to acquire culture in the course of 
socialization and, in the popular oversocialized view (Wrong 1961), to enact it 
unproblematically. It followed from this perspective that there was little reason 
to worry about constructs used to study culture, for amy kind of "cultural stuff" 
could serve as an indicator of the underlying latent variable. 

By contrast, recent work depicts culture as fragmented across groups and 
inconsistent across its manifestations (Martin 1992). The view of culture as 
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values that suffuse other aspects of belief, intention, and collective life has 
succumbed to one of culture as complex rule-like structures that constitute 
resources that can be put to strategic use (Bourdieu 1990, Sewell 1992, Swidler 
1986). 

This shift makes studying culture much more complicated. Once we ac- 
knowledge that culture is inconsistent—that people’s norms may deviate from 
what the media represent as normal, or that our preconscious images and dis- 
cursive accounts of a phenomenon may differ—it becomes crucial to identify 
units of cultural analysis and to focus attention upon the relations among them. 
In effect, our measures stop being indicators of a latent variable (culture), and 
their relationship to culture becomes analogous to that of education, income, 
and place of residence to social stratification: separate phenomena, analytically 
related to acommon theoretical construct, the relations among them a matter for 
empirical investigation (D' Andrade 1995 notes similar trends in anthropology). 

Similarly, once we acknowledge that people behave as if they use culture 
strategically, it follows that the cultures into which people are socialized leave 
much opportunity for choice and variation. Thus our attention turns to ways in 
which differing cultural frames or understandings may be situationally cued. 
Addressing such issues requires more elaborate and contestable psychological 
presuppositions than did the culture-as-latent-variable view. 


Psychology: More Complex Views of Cognition 


Such questions make it sensible for sociologists of culture to turn to psychology 
for insight into the mechanisms through which shared culture enters into cog- 
nition. Yet nothing guarantees that psychologists, who have their own research 
agendas, can help us. Thirty years ago, behaviorism made psychology essen- 
tially irrelevant to the study of culture. Twenty years ago, psychologists casting 
off the yoke of behaviorism focused primarily on the acquisition of skills and 
capacities of little interest to most sociologists of culture. Even a dozen years 
ago, the implications for cultural sociology of many of the ideas and research 
traditions that are most useful today were still unclear. 

What has happened to make psychology useful to sociologists of culture? 
First, psychologists have rejected behaviorism, accepted and demonstrated the 
existence of mental structures used to perceive, process, and retrieve informa- 
tion, and found ways to make inferences about such structures. Second, just 
as sociological research has demonstrated culture’s complexity and fragmenta- 
tion, psychological research has demonstrated the complexity of memory and 
provided glimpses of the partitioning of mental structures by domain. Third, 
recent foci of psychological research (schemata, categories, mental models, 
and so on) are much richer in cultural content than the formal operations or in- 
tellectual capacities that once preoccupied cognitivists and developmentalists 
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(Rogoff & Chavajay 1995). Fourth, some psychologists have taken notice of 
such sociological topics as cross-cultural differences in cognition (Shweder & 
Bourne 1991, Markus & Kitayama 1991), elite/popular interaction in cultural 
change (Moscovici 1984), and “distributed cognition” (i.e. the social division 
of cognitive labor) (Resnick et al 1991, Salomon 1993). 

In addition to expanding the grounds of shared interest between the two dis- 
ciplines, such developments have also softened two important epistemological 
differences. Whereas most sociologists of culture have been steadfastly an- 
tireductionist, resisting efforts to portray culture as the aggregate of individual 
subjectivities, psychology has focused upon the individual. Increasingly, how- 
ever, as I shall argue, psychological research bolsters and clarifies the view of 
culture as supra-individual, and even addresses supra-individual aspects of cog- 
nition directly [as in work on pluralistic ignorance (Miller & Prentice 1994)]. 

Second, some sociologists of culture rejected the subjectivist focus of psy- 
chological research, calling instead for research on external aspects of culture 
amenable to direct measurement (Wuthnow 1987). In recent years, cognitivists 
have developed ingenious empirical techniques (reviewed in D’ Andrade 1995) 
that permit strong inferences about mental structures, going far toward closing 
the observability gap between external and subjective aspects of culture. 

Of course, the fit between the disciplines must aot be exaggerated. Most 
of what psychologists do is irrelevant to sociologists of culture, and much 
of the culture sociologists’ study is supra-individua.. Common ground has in- 
creased but will remain limited by the different subject matters of the disciplines 
(Zerubavel 1997), which will remain complements -ather than substitutes. 


COGNITIVE PRESUPPOSITIONS 
OF CULTURAL SOCIOLOGY 


Sociologists who write about the ways that culture enters into everyday life 
necessarily make assumptions about cognitive processes. If we assume that a 
shared symbol evokes a sense of common identity (V/arner 1959), that a certain 
frame provokes people to think about a social issue in a new way (Gamson 
1992), that lessons about the structure of space and time learned in school are 
generalized to the workplace (Willis 1977), or that surveys can measure class 
consciousness (see Fantasia’s critique 1995), we are then making powerful cog- 
nitive assumptions. Such assumptions, while metatheoretical to sociologists, 
are keenly empirical from the standpoint of cognitive psychology. It 1s cru- 
cial, then, to evaluate our assumptions (or adjudicate differences among them) 
by microtranslating presuppositions (Collins 1981) to the cognitive level and 
assessing their consistency with results of empirical research on cognition. 
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Coherence vs Fragmentation 


Many sociologists have come to reject the latent-variable view of culture as 
coherent, integrated, and ambiguous in favor of representations of culture as a 
“toolkit” (Swidler 1986) or “repertoire” (Tilly 1992): a collection of stuff that 
is heterogeneous in content and function. Yet much empirical work on culture 
still presumes that culture is organized around national societies or cohesive 
subnational groupings, is highly thematized, and is manifested in similar ways 
across many domains (Hofstede 1980, Bourdieu 1984). 

Is culture a latent variable—a tight network of a few abstract central themes 
and their more concrete entailments, all instantiated to various degrees in a 
range of symbols, rituals, and practices? If so, then we would expect to find 
that group members share a limited number of consistent elements—beliefs, 
attitudes, typifications, strategies—and that the inclusion of any one element in 
the collective culture implies the exclusion of inconsistent elements. 

Or is culture a grab-bag of odds and ends: a pastiche of mediated repre- 
sentations, a repertoire of techniques, or a toolkit of strategies? If so, then 
we might expect less clustering of cultural elements within social groups, less 
strong linkages among the elements, and weaker pressures for the exclusion of 
inconsistent elements. 

Research in cognitive psychology strongly supports the toolkit over the latent- 
variable view and suggests that the typical toolkit is very large indeed. Partic- 
ularly relevant here is research (summarized by Gilbert 1991) on how people 
attribute accuracy or plausibility to statements of fact and opinion. Consistent 
with Swidler's (1986) contention that "all people know more culture than they 
use," Gilbert reports that “The acceptance of an idea is a part of the automatic 
comprehension of that idea, and the rejection of the idea occurs subsequent to 
and more effortfully than its acceptance." In other words, our heads are full of 
images, opinions, and information, untagged as to truth value, to which we are 
inclined to attribute accuracy and plausibility. 

Research on memory tells a similar story, revealing that information (includ- 
ing false information) passes into memory without being “tagged” as to source 
or credibility, and that active inference is required to identify the source of the 
information when it is recalled. Such inferences may be incorrect, yielding 
misattributions of source and credibility (Johnson et al 1991). 

This work has several important implications for students of culture. First, 
it refutes the notion that people acquire a culture by imbibing it (and no other) 
through socialization. Instead, it directs the search for sources of stability and 
consistency in our beliefs and representations, first, to schematic organization, 
which makes some ideas or images more accessible than others; and, second, 
to cues embedded in the physical and social environment. 
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Second, learning that people retain (and store with a default value of “cor- 
rect") almost every image or idea with which they have come into contact, 
renders intelligible otherwise anomalous research findings about inconsistency 
in expressions of attitudes across time, cultural volatility 1n periods of rapid 
change (e.g. the fall of the Soviet system), and the scsceptibility of attitudes to 
framing effects (Sniderman & Piazza 1993). 

Third, the research explains the capacity of individuals to participate in multi- 
ple cultural traditions, even when those traditions contain inconsistent elements. 
Fourth, it establishes the capacity of people to maintain distinctive and incon- 
sistent action frames, which can be invoked in response to particular contextual 
cues. Fifth, this work raises the possibility that socialization may be less ex- 
perientially based, and more dependent upon media images and hearsay, than 
many of our theories (for example, Bourdieu's habitus [1990] construct) imply. 

Such inferences as these go beyond the scope of cognitive studies, to be sure, 
and much rides on the precise ways in which schematic organization imposes 
order upon stored knowledge and memory. Nonetheless, recent cognitive re- 
search strongly reinforces the “toolkit” as opposed to the “‘latent-variable” view 
of culture and, at the very least, places the burden cf proof on those who de- 
pict culture as strongly constraining behavior or who would argue that people 
experience culture as highly integrated, that cultural meanings are strongly the- 
matized, that culture is binding, and that cultural infcrmation acquired through 
experience is more powerful than that acquired through other means. 


Institution and Agency 


Cognitive research can also enhance our appreciation of the view that culture 
both constrains and enables (Sewell 1992). Although this position has become 
virtually catechismic among sociologists of culture, we know little about the 
conditions under which one or the other is the case. Many sociologists believe, 
following Gramsci (1990), that culture, embedded in language and everyday 
practices, constrains people's capacity to imagine alternatives to existing ar- 
rangements. At the same time, we know that people act as if they use cultural 
elements strategically to pursue valued ends (Bourcieu 1990). Cognitive re- 
search cannot answer the essentially sociological question of when culture does 
each, but it can provide direction to the search. 

The finding that culture 1s stored in memory as an ir.discriminately assembled 
and relatively unorganized collection of odds and er.ds imposes a far stronger 
organizing burden on actors than did the earlier oversocialized view. The ques- 
tion, then, is how the actor organizes the information that she or he possesses 
Psychological research points to two quite different mechanisms or modes of 
cognition. 
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AUTOMATIC COGNITION The first, and most important, which I refer to as au- 
tomatic cognition is “implicit, unverbalized, rapid, and automatic” (D’ Andrade 
1995). This routine, everyday cognition relies heavily and uncnitically upon 
culturally available schemata—knowledge structures that represent objects or 
events and provide default assumptions about their characteristics, relation- 
ships, and entailments under conditions of incomplete information. 

Psychological research on schemata is central to the interests of sociologists 
both methodologically (due to advances in techniques that reveal taken-for- 
granted assumptions to which subjects may not have easy verbal access) and 
substantively, for what it tells us about how culture works. Indeed, for some 
purposes, it may be useful to treat the schema as a basic unit of analysis for 
the study of culture, and to focus on social patterns of schema acquisition, 
diffusion, and modification (Carley 1991 makes a related argument). 

Schemata are both representations of knowledge and information-processing 
mechanisms. As representations, they entail images of objects and the relations 
among them. Psychologists use the term broadly [some would suggest too 
broadly (Fiske & Linville 1980)]. It can refer to simple, highly abstract con- 
cepts [for example, container (D’ Andrade 1995)]; to concrete activities (buying 
chewing gum), or to complex social phenomena (group stereotypes or social 
roles). Event schemata or scripts (Abelson 1981, Garfinkel 1987) constitute 
an important class of schemata. Special attention has also been given to self 
schemata (Milburn 1987, Markus & Kitayama 1994, Markus et al 1997), cultur- 
ally variable representations of the self that provide stability both to individual 
behavior across time and to social interactions within the group. 

Schemata are also mechanisms that simplify cognition. Highly schematic 
cognition is the realm of institutionalized culture, of typification, of the habitus, 
of the cognitive shortcuts that promote efficiency at the expense of synoptic 
accuracy (Berger & Luckman 1967, Bourdieu 1990, Kahneman et al 1982). 
Much cognitive research demonstrates that “schematic material dominates other 
material in accurate recall, in intruded recall, in recognition confidence, in recall 
clustering and in resistance to disconfirmation.... Schemata also facilitate 
inaccurate recall when the information is schema consistent” (Fiske & Linville 
1980: 545). In schematic cognition we find the mechanisms by which culture 
shapes and biases thought. 


People are more likely to perceive information that is germane to existing 
schemata Von Hippel et al (1993) report that experimental subjects are more 
likely to perceive correctly terms that are schematically relevant than those that 
are not. Information embedded in existing schemata and information that is 
schema-dissonant are both more likely to be noticed than information orthogo- 
nal to existing structures (Schneider 1991). Such laboratory findings resonate 


270 DiMAGGIO 


with results in historical sociology and cultural studies: for example, the grad- 
ual and halting acceptance of information about the New World by early modern 
mapmakers (Zerubavel 1992); the ways in which archaic physical models con- 
strained medical scientists’ interpretation of new evidence about syphilis (Fleck 
1979); and the penchant of male biologists for seeing dominance hierarchies 
when they watch apes and elephant seals (Haraway 1991). 


People recall schematically embedded informatior. more quickly Most psy- 
chological evidence is based on laboratory experiments, which reveal that sub- 
jects remember longer lists of words, or interpret ambiguous stimuli more accu- 
rately, and retrieve information about a story they have heard more effectively 
if itis relevant to preexisting mental structures that render the information inter- 
pretable (Sedikides & Skowronski 1991). But again, there are intriguing socio- 
logical parallels in studies that report cross-cultural differences in descriptions 
of the content of the same novel (Griswold 1987), television program (Liebes 
& Katz 1990), or movie (Shively 1992) that reflect collective preoccupations 
("chronically activated mental structures" in psychological parlance). 


People recall schematically embedded information more accurately When 
Freeman et al (1987) asked members of a faculty workshop to list the people who 
had attended the previous meeting, they found that lcng-term attenders correctly 
recalled participants who regularly attended, but forgot the infrequent atten- 
ders. Using a very different method (analysis of Wa-ergate transcripts), Neisser 
(1981) reported that Nixon aide John Dean remembered schema-consistent 
events more accurately than events that were schema-inconsistent. 


People may falsely recall schematically embeddea events that did not occur 
Freeman et al's (1987) informants remembered regular attenders as present at 
the meeting in question even when they hadn't been there. When subjects are 
told to code small-group interactions and then given questionnaires about char- 
acteristics of group members shortly thereafter, the post-hoc evaluations yield 
much higher correlations of schematically related behaviors (e.g. criticizing 
or expressing hostility) than do the real-time codings (Shweder 1982). Similar 
confusion of schematic representations for real events may be observed in at 
least some reports of satanic child abuse (Hacking 1995) and in some of former 
President Reagan's speeches. 

The parallel with sociological accounts of insti-utions 1s striking. Typifi- 
cations (mental structures) influence perception, interpretation, planning, and 
action (Berger & Luckman 1967, DiMaggio & Powell 1991). Institutionalized 
structures and behaviors (1.e. those that are both highly schematic and widely 
shared) are taken for granted, reproduced in everyday action [Giddens’ “struc- 
turation" (1984)] and treated as legitimate (Meyer & Rowan 1977). Indeed, an 
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eminent psychologist (Bruner 1990:58) has written explicitly of the “schema- 
tizing power of institutions.” Thus the psychology of mental structures provides 
a microfoundation to the sociology of institutions. 

Research on social cognition enhances our understanding of how culture con- 
strains but does not support theories that depict culture as overwhelmingly con- 
straining. Instead, consistent with contemporary sociological theorizing, work 
in psychology provides microfoundational evidence for the efficacy of agency. 


DELIBERATIVE COGNITION In contrast to automatic thought, psychologists 
note a quite different form of cognition, which is “explicit, verbalized, slow, 
and deliberate" (D' Andrade 1995). When sufficiently motivated, people can 
override programmed modes of thought to think critically and reflexively. 

Such overrides are necessarily rare because deliberation is so inefficient in 
its rejection of the shortcuts that automatic cognition offers. Consequently, the 
key question is why people are ever deliberative. Psychologists have identified 
three facilitating conditions in studies that intriguingly parallel work in the 
sociology of culture. 


Attention Psychological research suggests that people shift into deliberative 
modes of thought relatively easily when their attention is attracted to a problem. 
For example, experimenters can create false recollections of a videotape or story 
among laboratory "witnesses" by presenting inaccurate information or asking 
leading questions (Loftus et al 1989). But when the task is changed to ask 
subjects to think carefully about the source of particular bits of information, 
the experimental effect is diminished or eliminated (Johnson et al 1993). In 
experimental studies of attitude-behavior consistency, merely increasing self- 
awareness by placing a mirror in the face of the subject as he or she completes an 
attitude questionnaire significantly increases the attitude-behavior correlation 
(Abelson 1981:722). Such results parallel the insights of students of social 
movements, who have studied agenda-building and who have also noted the 
effectiveness as an organizing device of reframing issues in ways that call 
attention to problems salient to movement participants (Snow & Benford 1992). 


Motivation People may also shift from automatic to deliberative cognition 
when they are strongly motivated to do so by dissatisfaction with the status 
quo or by the moral salience of a particular issue. For example, although 
racist schemata are accessible to most white Americans, whites can override 
such schemata to some extent through awareness and reflexivity (Devine 1989). 
Marx's theory of class consciousness—which contends that physically prox- 
imate workers facing immiseration will overcome false beliefs through inter- 
action and reflection—is a classic sociological counterpart (and see Bourdieu 
1974). 
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Schema failure Finally, people shift to more deliterative modes of processing 
when existing schemata fail to account adequately for new stimuli. Research 
on the psychology of intergroup relations suggests that people in task groups 
initially code others on the basis of stereotypes but shift to more deliberate evalu- 
ations when faced with very strong inconsistent evidence (Schneider 1991:536, 
Berger et al 1980). Moscovici, whose Durkheimian social psychology dif- 
fers in many respects from other psychological accounts of mental structures 
(Farr & Moscovici 1984, Augoustinos & Innes 1950), argues that collectivities 
confronted with disjunctive social change construct new social representations 
(often anchored in analogies to pre-existing schemata, and often constructed 
deliberatively by experts in the social sciences and mass media) in order to inter- 
pret new stimuli. Such arguments are paralleled in Garfinkel’s (1967) breaching 
experiments, which forcibly and painfully overrod2 automatic processing, and 
in Swidler’s contention that ideologies and other consistent cultural forms are 
more influential during unsettled times (1986, Jepperson & Swidler 1994 on 
constitutive vs. strategic culture). 

Psychologists may note that I have paid scant attention to active debates 
about the nature of mental structures and have drawn too sharp a contrast 
between automatic and deliberative processing. Research on culture, however, 
can already benefit from what research on cognition has resolved. The notion 
of schema is a fair approximation of phenomena identifiable in fuzzy outline, 
if not sharp relief, by experimental methods; research on schemata advances 
sociological understandings of culture, especially tnstitutions; and research on 
automatic vs deliberative processing may help sociologists determine what to 
do with the widely believed but theoretically inert notion that both institution 
and agency are central to social life. 


Culture as Supra-Individual 


It is no news to sociologists that culture exists, sui generis, at the collective level. 
(The position taken here—that culture is also manifest in people’s heads—is 
probably more controversial.) Nonetheless, psychological research can help us 
appreciate several aspects of culture’s supra-individual character that sociolo- 
gists of culture sometimes neglect. 


PLURALISTIC IGNORANCE A lively branch of social-psychological research 
derives from Robert K. Merton’s notion of “pluralistic ignorance” (1957): the 
idea that people act with reference to shared representations of collective opin- 
ion that are empirically inaccurate. Such research directs us to distinguish 
between two senses in which culture is supra-individual: as an aggregate of 
individuals’ beliefs or representations, or as shared representations of individu- 
als’ beliefs. Substantial evidence indicates that the latter deviates substantially 
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from the former with significant behavioral consequences and that this process 
represents a basis for the relative autonomy of social norms (Miller & Prentice 
1996, Noelle-Neumann 1993). 


INTERGROUP CONTRAST AND POLARIZATION The existence of group-level 
cultures (shared understanding partly independent of individual beliefs) is also 
suggested by the tendency of groups to adopt public positions more extreme 
than the preferences of their members, especially when acting with reference 
to a contrasting group. What is striking is not polarization per se, but the cul- 
tural availability of polarized stances (representations of collective opinion) on 
which members of each group can converge (Tajfel 1981). 


SCHEMATA AS CULTURE Not all schemata are cultural to the same degree. 
Some schemata reflect universal cognitive processes (for example, basic object 
categorization), whereas others may be quite idiosyncratic. Many schemata, 
however, and the schemata of greatest interest to sociologists of culture, en- 
act widely held scripts that appear independent of individual experience. For 
example, the research, cited above, that found coherence in ratings of small 
group behavior emerging only after the fact, led the author (Shweder 1982) to 
speculate that much of what passes as clinical research on personality is really 
about cultural constructions of personhood (and see Meyer 1986). 


COHERENT CULTURES AS EXTERNAL TO PERSONS Despite this chapter's fo- 
cus on subjective representations of culture, we must not forget that relatively 
coherent cultural forms exist independently of persons in the broader environ- 
ment. Indeed, one of the more notable characteristics of modern societies is 
the existence of a cultural division of labor in which intellectual producers in- 
tentionally create and diffuse myths, images, and idea systems (Douglas 1986, 
Farr & Moscovici 1984, Swidler 1997). Other relatively coherent representa- 
tions exist less formally as narratives or stories repeatedly invoked in public 
discourse (Dobbin 1994, White 1992). 


AN INITIAL SYNTHESIS Some would argue that whatever coherence exists 
flows from such externally available sources, i.e. that cultural coherence is 
entirely external to the person. As we have seen, however, such a position 
pushes the healthy shift from the latent-variable to the toolkit one step too far. 
Instead, the research reviewed here suggests that culture works through the in- 
teraction of three forms. First, we have information, distributed across persons 
(Carley 1991). Such distribution is patterned, but not highly differentiating, 
due to the indiscriminant manner in which bits of culture are accumulated and 
stored in memory (Gilbert 1991). Second, we have mental structures, espe- 
cially schematic representations of complex social phenomena, which shape 
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the way we attend to, interpret, remember, and respond emotionally to the in- 
formation we encounter and possess. Such schemata are more clearly socially 
patterned than are memory traces. Finally, we have culture as symbol systems 
external to the person, including the content of talk, elements of the constructed 
environment, media messages, and meanings embedded in observable activity 
patterns. 

Culture inheres not in the information, nor in the schemata, nor in the sym- 
bolic universe, but in the interactions among them. A.s we have seen, schemata 
structure our use of information. But people acquire many schemata throughout 
their lives, and some of these are inconsistent both in content and in implica- 
tions for behavior. How is it that people invoke one among the many schemata 
available to them in a given situation? 

To simplify greatly in order to focus upon the aspect of the process most 
relevant to the sociology of culture, selection is guided by cultural cues avail- 
able in the environment. Although a few schemata may be chronically avail- 
able, more often they are primed or activated by an external stimulus or frame 
(Sedikides & Skowronski 1991, Barsalou 1992, Gamson 1992:6—8, Schud- 
son 1989). Framing effects in social surveys—e.g. the finding that whites are 
more likely to accept negative stereotypes of African-Americans if the question 
is preceded by a neutral reference to affirmative action (Sniderman & Piazza 
1993:102—104)—Aare familiar examples. But schemata can also be activated 
through conversation, media use, or observation of the physical environment. 
Understanding the interaction between two distributrons—of the schemata that 
constitute people's cultural toolkits, and of external cultural primers that act 
as frames to evoke (and, in evoking, exerting selection pressures upon) these 
schemata—is a central challenge for sociologists of culture. 


APPLICATIONS 


This section reviews work on cognitive aspects or the sociology of culture 
in light of the perspective developed here. The topics are identity, collective 
memory, social classification, logics of action, and framing. 


Identity 

Identity has become one of the most active research fields in the sociology of 
culture. It is useful to distinguish between two quite different kinds of collective 
identity: the identities of collectives, on the one hand, and collective aspects of 
the identities of individuals on the other. 


IDENTITIES OF COLLECTIVES At the supra-individual level, collective iden- 
tity is a shared representation of a collectivity. Reszarch at this level portrays 
collective identities as highly constructed (Anderson 1983), through explicit 


CULTURE AND COGNITION 275 


messages and more subtle elements such as anthems and flags (Cerulo 1994). 
Collective identities are chronically contested, as groups vie to produce social 
representations capable of evoking schemata favorable to their ideal or material 
interests (Moscovici 1984, Zerubavel 1994, Friedland & Hecht 1996). 
Another line of research, active in both psychology and sociology, views 
identities and selves as collective representations that vary cross-culturally and 
historically. Markus et al (1996) review research on differences in the cultural 
construction of identity in East Asian and Western societies. Meyer & Jepperson 
(1996) contend that the modern self (and its variations in different polities) is a 
constructed identity endowed with agency in relation to the collectivity. 


COLLECTIVE ELEMENTS IN INDIVIDUAL IDENTITIES Much research on collec- 
tive identity is actually about the more complex issue of the ways in which social 
identities enter into the constitution of individual selves. Social identity theory 
views individual identities as comprising prioritized identity-sets based on par- 
ticularistic and role-based group affiliations (Stryker 1986). Self-categorization 
theories also portray collective identities as invoked by conditions that make 
particular identities especially salient (Tajfel & Turner 1986). In this view, 
individual identities reflect elaborated group-identity schemata in proportion to 
strength and recency of activation. Viewing identities as context-dependent in 
this way is consistent with observations of the volatility with which identities 
may gain and lose salience during periods of intergroup conflict. 


Collective Memory 


Collective memory is the outcome of processes affecting, respectively, the 
information to which individuals have access, the schemata by which people 
understand the past, and the external symbols or messages that prime these 
schemata. Like collective identities, research on collective memory portrays 
the phenomenon in both supra-individual and individual terms. 

Several scholars have studied institutional processes that maintain or suppress 
information as part of public culture, such as factors determining the reputation 
and popularity of particular persons or art works (Fine 1996, Griswold 1986, 
Lang & Lang 1988). Much research, however, focuses upon the schematic level, 
studying struggles to define the ways in which members of a society interpret 
widely shared information about their past, either tracking change in the ways 
in which a person or public figure is understood over time (Schudson 1992, 
Schwartz 1991) or analyzing conflict over alternative visions of a collective 
past (Maier 1988, Zerubavel 1994). 

Little research has focused on the interaction between individual and col- 
lective memories. An exception is the work of Schuman & Scott (1989), who 
use survey methods to explore the possibility that the historical events that 
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men and women of different generations remember most vividly structure their 
understanding of contemporary social issues. 


Social Classification 


The study of social classification—the social construction and use of category 
schemes—has burgeoned in the last decade. Some work has analyzed processes 
of classification in historical time, describing the errergence of a strongly clas- 
sified artistic high culture (DiMaggio 1982), or the use of social categorization 
in the formation and implementation of social policies (Starr 1992). Of particu- 
lar interest is Mohr's (1994) analysis of “discourse roles,” which uses structural 
equivalence analysis to identify the implicit classification of social problems 
and client groups embedded in self-descriptions of social-service and poverty- 
relief organizations in early twentieth-century New York City. 

Other research has focused upon social differentiation in shorter time spans. 
Zelizer (1989) describes the process by which women find ways to differenti- 
ate even money, the universal medium of exchange, in order to imbue it with 
social meaning. Lamont (1992) analyzes the bases upon which men of differ- 
ent regional and national origins make social distiactions that reinforce their 
sense of social honor. Gieryn (1997) describes boundary work within scientific 
communities, examining how scientists respond when the strong classification 
science/nonscience is threatened. 

Zerubavel, one of few sociologists to study classification from a cognitive 
perspective, points out that the drive to partition a continuous world appears 
to be a human universal, though the nature of the categories constructed may 
vary significantly among groups (Zerubavel 1991, 1297, Douglas 1966). Rosch 
(1978), whose work has dominated psychological thinking on the topic, pro- 
poses (with much experimental support) that cognition is most efficient when 
we chunk many separate features (bits of information) together by thinking 
with a prototype (complete mental image) of an object. Prototypical constructs 
emerge at the most efficient level of abstraction: 1.e. where an increase in speci- 
ficity provides the greatest marginal increase in information. Thus we have 
prototypes for "chair" but not "furniture" or "divan," and for "bird" but not 
for "animal" or “sparrow.” Although the level at which object prototypes form 
appears to be relatively universal, the specific content of a prototype reflects a 
mix of typicality and availability in a given location (D’ Andrade 1995). 

Rosch applied her model of prototypes to relatively simple concepts. Self- 
categorization theory draws on the prototype model (Hogg & McGarty 1990), 
but it remains to be seen 1f complex social constructs are represented in such 
unambiguous terms. If so, application to role analysis may be useful, in light 
of an intriguing parallel between Rosch's characterization of a prototype as a 
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core of essential features and Nadel’s (1957) classic definition of social role as 
consisting of a core of entailments and a penumbra of optional features. 


Logics of Action 

Many authors have used the expression “logics of action” to refer to an inter- 
dependent set of representations or constraints that influence action in a given 
domain. Sometimes, of course, the term is used as a synonym for “ideal type” 
(Orru 1991) or, in rational-actor approaches, to refer to situational constraints 
that induce parallel behaviors among players with similar resources given par- 
ticular rules of the game (Block 1990, Offe 1985). 

A richer, more cultural, sense of logics has emerged in recent work in political 
economy, a view that embeds them in the interaction between mental structures 
instantiated in practical reason (Bourdieu 1990), on the one hand, and institu- 
tional requirements on the other. Friedland & Alford (1991:248—49) provide 
the most thorough exposition and definition, describing “institutional logics” 
as sets "of material practices and symbolic constructions" that constitute an 
institutional order's “organizing principles" and are “available to organizations 
and individuals to elaborate.’ According to Friedland & Alford, these logics 
are “symbolically grounded, organizationally structured, politically defined and 
technically and materially constrained." 

Similar imagery is apparent in Boltanski & Thévenot's notion of modes 
of justification (1991), institutionally linked discourses embodying specific 
orientations toward action and evaluation. Empirical development of similar 
ideas can be found in Fligstein's (1990) work on "conceptions of control" in 
corporate governance, and in Stark's (1990) analysis of shop floor politics in a 
Hungarian socialist factory. 

Such work requires a taxonomy of institutions, each of which entails a dis- 
tinctive logic. (For Friedland & Alford, the institutions are capitalism, the 
state, democracy, family, religion, and science, each of which has its own ax- 
ial principle and linked routines and rituals.) Conflict erupts from the clash 
of institutional logics, as when a wife views her household labor through a 
marketplace logic of explicit exchange, whereas her husband imposes a family 
logic of selfless service upon the situation. 

The notion of logics is immensely appealing. First, it proposes that external 
rituals and stimuli interact with internal mental structures to generate routine 
behavior. Second, it is consistent with the view that culture is fragmented 
among potentially inconsistent elements, without surrendering the notion of 
limited coherence, which thematization of clusters of rituals and schemata 
around institutions provides. Third, it provides a vocabulary for discussing 
cultural conflict as confrontation between inconsistent logics of action. 
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At the same time, the work remains frankly exploratory and calls attention 
to gaps in our current understanding of culture and cognition, which neither 
psychology nor sociology can address. These are the topics of the next section. 


KEY PROBLEMS IN THE STUDY 
OF CULTURE AND COGNITION 


The notion of institutional logics can be reinterpreted as an effort to thematize 
schemata and link them to social structure. In order to exploit the insights this 
perspective offers, students of culture need three th:ngs that we now lack: an 
understanding of how schemata aggregate to more complex cultural structures, 
or “logics”; an understanding of cultural change, which, in turn, requires a clear 
understanding of the way in which actors switch among institutional logics; and 
a theory of analogy, which is necessary if we are to understand processes of 
schematic generalization that thematization and switching both require. 


Models of Schematic Aggregation 


Perhaps the highest priority for students of culture and cognition is to develop 
models of thematization, by which I mean the ways in which diverse schemata 
aggregate to more general and sociologically interesting constructs like thought 
styles, stories, logics, paradigms, and 1deologies. There are several candidates 
for such models. 


ATOMISTIC DECOUPLING The null hypothesis is that everyday thought is pop- 
ulated by randomly invoked, loosely coupled schemata with little or no higher- 
level architecture. If so, thematization is simply imposed post hoc by cultural 
specialists or embedded in the environment and in everyday routines. Although 
this view is inconsistent with most work in the sociology of culture, and would 
seem ill-equipped to explain either experimental research on schemata or macro- 
cultural change, it cannot now be disconfirmed absolutely. 


NESTED HIERARCHY At the opposite extreme is the view of cross-cultural 
psychologists that culture comprises a hierarchy of nested schemata, arrayed 
from abstract to concrete, with the latter entailed by the former. For exam- 
ple, Markus & Kitayama (1994) view a wide ranze of cognitive differences 
between Japanese and Americans as flowing from fundamental differences 1n 
self-schemata. Although they provide compelling evidence of significant in- 
tergroup differences, one need not assume as much coherence as they do. 


DOMAIN-SPECIFICITY There is considerable evidence that information and 
schemata pertaining to different life domains is stored in distinct areas of mem- 
ory, with schematic integration occurring within specific domains (Hirschfeld 
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& Gelman 1994). In this view, clusters of schemata are coherent only within 
limited boundaries; taken together, the domains are “more like the collected 
denizens of a tide pool than a single octopus" (D' Andrade 1995:249). 

This view has considerable experimental support, though there is little con- 
sensus as to the size or character of the domains. It is tempting to equate 
“domain” with the institutional realms identified by Friedland & Alford (1991) 
or Boltanski & Thévenot (1990), and to posit that culturally specific “logics 
of action" are thus embedded in schematic organization, but there is at present 
little if any empirical warrant for doing so. 


IDENTITY CENTRALITY Some evidence suggests that affectively hot schemata 
are more salient and have more extensive entailments than do emotionally neu- 
tral structures. Work on identity (Wiley & Alexander 1987, Hogg & McGarty 
1990) suggests the possibility that “the self" may be an emotionally supersat- 
urated cluster of schemata tending toward consistency and stability over time. 
Schemata that are embedded in the self-schemata, then, are more closely artic- 
ulated with other schemata than those that are not incorporated into the self. 


ROLE CENTRALITY By analogy, one can view roles as situationally evoked, 
emotionally activated, partial identities that provide integrated chunks of schem- 
atic organization and permit compartmentalization of different cultural contents. 
This perspective is appealing because it identifies a mechanism (i.e. role ac- 
tivation) connecting schematic triggering to contextual variation, and because 
it is consistent with evidence for domain-specificity of schematic organization. 
Moreover, because roles are embedded in distinctive role relations, this view 
points toward an integration of cultural and network analysis within a single 
framework (McCall 1987). 

Which of these models of schematic thematization best describes the pro- 
cesses by which people integrate schemata is at present anybody's guess. Sig- 
nificant matters—the extent to which ideology enters into conscious experience, 
the patterning of cultural styles or orientations, and the stability of cognition 
across context—ride on its resolution. 


Cultural Change 


A second priority for sociologists of culture is to create theories of cultural 
change that integrate ideas from research on culture and cognition with macroso- 
ciological perspectives. At least four different change processes are crucial to 
understand. 


THEORY OF ENVIRONMENTAL TRIGGERING I have argued that culture enters 
into everyday life through the interaction of environmental cues and mental 
structures. I have further suggested, by combining logic-of-action theories in 
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sociology and domain-specificity theories in psychalogy, that cultural under- 
standings may be fragmented by domain, so that when persons or groups switch 
from one domain to another, their perspectives, attitudes, preferences, and dis- 
positions may change radically. It follows that large-scale cultural changes may 
be caused by large-scale, more-or-less simultaneous frame switches by many 
interdependent actors. 

At the micro level, we need a better understand:ng of how and why peo- 
ple switch among frames, logics, or domains (White 1995; from a rational 
choice perspective, Lindenberg & Frey 1993), The paradigmatic work on this 
comes from language, where research on code-switching has documented the 
circumstances (ordinarily changes in context, conversation partner, or topic) 
that trigger change in language or dialect (Gumperz 1982). At the macro level, 
the challenge is to create models that link environmental change to patterns of 
switching (White 1995). 


THEORY OF SCHEMA ACQUISITION, DIFFUSION, AND EXTINCTION  Psycholo- 
gists have cast substantial light on the acquisition of schemata by individuals 
during development (Nelson & Gruendel 1981, Hirschfeld 1994). Sociolo- 
gists of culture should turn their attention to factors leading to change in the 
distribution and level of activation of cultural representations or schemata in 
the population. Such change may occur if differert cohorts acquire particular 
schemata at varying rates; or if changes in the distribution of environmental 
cues lead to enhanced activation or deactivation of particular schemata that 
have already been acquired. 

Diffusion models of the sort that have been tsed to study the effects of 
media exposure on the adoption of new technologies or beliefs may be useful. 
Diffusion should be most effective where resonance exists between the new 
cultural element and existing schematic organization (Sperber 1985). 

Work in the historical sociology of culture provides some guidance. 
Wuthnow's (1989) macro-theory of ideological change, which points to the ım- 
portance of ecological effects on the life chances of new beliefs, may be usefully 
transposed to more micro levels. Tilly (1992) has developed and implemented a 
valuable approach to studying change over time in zontentious movement reper- 
toires. Buchmann & Eisner (1996) present evidence of accelerating change 
in the public presentation of selves during the second half of the twentieth 
century. 

A particular challenge is to understand cognitive aspects of major collective 
events in which large numbers of persons rapidly adopt orientations that might 
have appeared culturally alien to the majority of them a short time before. Some 
religious revivals, the emergence of capitalism after the fall of the Soviet Union, 
and some spirals of ethnic antagonism are demanding cases of this kind. 
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THEORY OF DELIBERATIVE OVERRIDING Itis important to understand not only 
how culture constrains, but how persons and groups can transcend the biasing 
effects of culture on thought. Work on this problem by psychologists (noted 
earlier) must be supplemented by research on the types of social interaction 
that lead large numbers of people to question and, ultimately, to revise their 
schematic representations of social phenomena. 


Analogy and Generalization 


Related to the study of change, but so important that it warrants a section of its 
own, is the problem of analogy and generalization. Sociological theories that 
portray persons as actively incorporating culture into cognitive organization 
invariably rely on some notion like the habitus, which Bourdieu (1990) refers 
to as a “system of durable transposable dispositions.” The key question for 
all of these theories is: Under what conditions are dispositions or schemata 
abstracted and transposed from one domain to another? 

Almost all cultural change entails the transfer of some body of ideas or 
images from one content area to another on the basis of similarity judgments. 
Indeed, any attempt to characterize the culture of a group or a people in abstract 
terms—i.e., any analytic effort at thematization—takes for granted that actors 
have the capacity to draw analogies between classes of objects, actors, events, 
or actions, and thereby to understand them in similar ways. 

Think of culture as a network of interrelated schemata, with analogies as the 
“ties” that create paths along which generalization and innovation occur. How 
are new “ties” created? The literature provides at least three alternatives. 


FEATURE CORRESPONDENCE Inthe most straightforward models, two schem- 
ata or related structures lend themselves to analogy (and thus to generalization 
across domains) insofar as they share particular features (Lakoff & Johnson 
1980) that create a correspondence between them. Thus Swinburne’s line, 
“when the hounds of spring are on winter’s traces," is meaningful because 
of the correspondence between temporal and spatial pursuit and between the 
destructive effects of hounds on hares and of spring on winter. Two problems 
with this view are that the correspondence itself ts constructed rather than innate; 
and that analogical power would not seem to vary with the extent of overlap 
between tenor and vehicle. 


STRUCTURE-MAPPING This view takes as its starting point the existence of 
some form of content-related domain-specificity. Analogies connect not sim- 
ply schemata but whole domains (Tourganeau & Sternberg 1982), deriving their 
power from the network of entailed comparisons they trigger. The most pow- 
erful analogies connect domains that are structurally homologous. Put another 
way, generalizability across domains is a function not of the extent to wbich 
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they share particular features in common, but of the extent to which relations 
among features are structurally similar (Gentner 1983). 


EMOTIONAL RESONANCE Someresearch suggests that affectively hot schemata 
are more likely to be generalized across domains than affectively neutral sche- 
mata. For example, analogies are likely to be drawn between situations that 
elicit strong emotional reactions of a similar kind (Abelson 1981:725). 


POLYSEMY AND SEMANTIC CONTAGION A final possibility 1s that polysemous 
expressions—those with distinct meanings that resonate with multiple schemata 
or domains—facilitate analogical transfer. Bakhtin's work (1986) on textual 
multivocality is suggestive in this regard, as is White's (1992) work on stories 
and rhetorics. Ross (1992) portrays meaning as emerging from the relations 
of words to one another in speech and to activities in real time. Because 
these constantly change, meanings are rarely fixed, but instead adapt, diverge, 
and spread across domains through semantic contagion. This perspective is 
particularly attracuve because it acknowledges endemic change in language and 
other symbol systems and because ıt embeds generalization in social interaction. 


SYMBOLS, NETWORKS, AND COGNITION 


Cognitive aspects of culture are only one—and not necessarily the largest —part 
of the sociology of culture's domain. But it is a part that we cannot avoid if we 
are interested in how culture enters into people's lives, for any explanation of 
culture's impact on practice rests on assumptions about the role of culture in 
cognition. I have argued that we are better off if we make such models explicit 
than if we smuggle them in through the back door and that work in cognitive 
psychology and social cognition, although animated by different questions, 
offers tools that we sociologists can use to pursue our own agendas. 

Ultimately, the challenge is to integrate the micro perspectives on culture de- 
scribed here with analyses of cultural change in larger collectivities over longer 
stretches of time. I have argued for a perspective that privileges schemata and re- 
lated constructs as units of analysis, and attends to mechanisms by which phys- 
ical, social, and cultural environments differentially activate these schemata. 

This argument has begged the question of which aspects of the environment 
are most worthy of study. Without denying the unquestionable importance of 
research on how media and activity structures interact with subjective cultural 
representations, I shall conclude by calling brief attention to new research on the 
relationship of cognitive and symbolic phenomena to social structures portrayed 
as social networks. 

Some researchers have focused on cognitive representations of social struc- 
ture. [Fiske & Linville (1980) claim that schema theory is especially relevant 


CULTURE AND COGNITION 283 


to the representation of social phenomena; and see Howard (1994).] The idea 
that social structures exist simultaneously through mental representations and 
in concrete social relations was central to Nadel’s (1957) role theory. Both 
theorists (Emirbayer & Goodwin 1994, Orr 1995, White 1992) and researchers 
(Krackhardt 1987) are exploring the implications of this view. 

Networks are crucial environments for the activation of schemata, logics, and 
frames. In a study of the Paris Commune, Gould (1995) argues that political 
protest networks did not create new collective identities, but rather activated 
identities that communards already possessed. Bernstein (1975) demonstrates 
the impact of network structures on individuals' tendency to employ cognitive 
abstraction. Erickson (1996), studying security guards, finds a correlation be- 
tween the complexity of social networks and the diversity of conversational 
interests. Vaughan (1986) describes how people questioning marriage alter 
customary patterns of social relations in order to create new, independent iden- 
tities as prologue to separation. Such studies point to a new, more complex 
understanding of the relationship between culture and social structure built 
upon careful integration of micro and macro, and of cognitive and material, 
perspectives. 
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ABSTRACT 

Women's entrance into the labor market in large numbers has exacerbated incom- 
patibilities between employer and family interests. Research reveals that conflict 
between paid work and family responsibilities has been linked to decreased em- 
ployee productivity as well as decreased family functioning. In this review, we 
explore the nature of job/family incompatibility, organizational interests in family 
responsive policies, and the current prevalence of various policies within work 
organizations. We then review what is known about the effectiveness of particular 
family-responsive policies on organizational and family functioning. Finally, we 
consider barriers to further institutionalization of family responsive policy and 
suggest future research and policy directions. 


INTRODUCTION 


As the number of dual-earner and single-parent households raising children 
continues to grow, pressure on organizations to attend to the family responsi- 
bilities of employees has been increasing (Families and Work Institute 1991, 
Goodstein 1994). The sexual division of labor spawned by the development of 
industrial capitalism in the nineteenth century has given way to a new demo- 
graphic reality—one in which the responsibilities of workers to provide primary 
physical care to their dependents are no longer segregated from their responsi- 
bilities to provide financially for their families. The US Census Bureau reported 
that 5296 of mothers of infants under one year of age were in the labor force as 
of 1991 (US Bureau of the Census 1992), while projections indicate that over 
8596 of employed women can expect to become pregnant at some point during 
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their work life. Indeed, some demographers argue that 90% of women aged 
18-49 should be considered active members of the labor force, since even those 
not currently employed will soon be again (Stipp 1988). 

These high rates of female labor-force participation affect men as well, since 
men are increasingly partnered to women who are likely to be lifetime labor 
force participants. Without housewives at home to attend to the organization 
and provision of care to children, fathers are also experiencing tensions between 
their work and family obligations, although not necessarily the same kind nor 
to the same degree as mothers (Gerson 1993). Judging from the increased 
proportion of employed men reporting child-care pro»lems in 1988 compared 
to 1984, a trend toward greater paternal involvement in family management 
may be emerging (Fernandez 1990). 


HISTORICAL SOLUTIONS 


The problem of reconciling family caregiving with paid employment is not 
new in the history of industrial capitalism. In the ninsteenth century, factories 
were sometimes filled with entire families working at the same location but at 
different tasks, justified in part by the need to adequately supervise and monitor 
children (Hareven 1982). But this pattern did not persist into the twentieth 
century as the supply of male labor increased and the effects of harsh working 
conditions on the health of mothers and children became a concern (Brenner 
& Ramas 1984). The family responsibilities of those women still compelled to 
work outside the home were addressed through protective labor legislation in 
the early twentieth century (Kessler-Harris 1982). 

Throughout this period, men's roles as husbands and fathers were not ignored 
by unions and industrialists but were mostly redefined as only breadwinning 
responsibilities. Thus, some large companies in the early decades of the twen- 
tieth century competed for the loyalties of their predominantly male workers by 
offering a smorgasbord of benefits such as health, life, accident insurance, and 
pensions in a system that has become known as “welfare capitalism" (Edwards 
1979). Because welfare capitalism proved to be expensive and failed to stop 
union activity, however, most programs did not survive the Great Depression 
(Edwards 1979). 

World War II brought with it a resurgence of interest in work/family issues 
as the number of women workers in crucial defense industries dramatically in- 
creased. Federally funded child-care centers were opened near defense-related 
factories, temporarily setting aside the ideology of separate spheres for men 
and women. Although those child-care centers were closed soon after the war 
ended, some scholars have argued that the war nevertheless ushered in an era 
in which unions pushed for and corporations increasingly provided social wel- 
fare benefits (Cornfield 1990), mirroring many of the benefits offered under 
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welfare capitalism—health and life insurance, pension plans, and disability 
protection. Importantly, however, this postwar expansion included federally 
mandated employer participation in such areas as workers’ compensation and 
Social Security, institutionalizing the notion that employers had at least some 
obligation to provide for the security of the families of their employees. 

Yet the degree to which employers rather than the state should be held respon- 
sible for the social welfare of families would continue to be contested. In the 
United States, to a far greater extent than in Western Europe's social democra- 
cies, employers have been allowed to vary 1n the degree of responsiveness they 
show to workers' family obligations. Strong welfare states in Western Europe 
assumed much greater responsibility for ensuring that employers contributed 
to the well-being of families, through legislative and tax policies that provide 
paid leaves and reduced work hours for parents of young children and that re- 
distribute income to families with children. The early dependence of Buropean 
economies on female labor after World War II resulted in explicitly fashioned 
"family policies" that were designed to protect birth rates while simultaneously 
incorporating mothers into the labor force (Haas 1991). 

In the United States, by contrast, dependence on married women's labor has 
come relatively late, and employers have mostly avoided regulatory policies 
that assist parents. Instead, the US welfare state has been fashioned around di- 
rect provision of services to families with heads unable to secure employment, 
financed through general tax revenues. As of this writing, even that minimal 
guarantee of government assistance to families with children was being deci- 
sively limited by federal welfare reform legislation. Yet, legislation that would 
force employers to accommodate childbearing and family caregiving, or even 
to sponsor affordable high-quality substitute care while parents work has not 
been swiftly forthcoming. For instance, the Family and Medical Leave Act of 
1993 was twice vetoed by Republican administrations. The version of federally 
mandated parental leave that ultimately was signed into law shortened the orig- 
inally proposed leave and contained provisions that exempted small businesses 
and employees in any business whose job duties were considered essential to 
the organization's operation. 

Some scholars believe nevertheless that the tide is slowly turning in favor of 
government intervention 1n the work/family relationship, pointing to the pro- 
liferation of bills in the US Congress dealing with work/family issues in recent 
years (Burstein et al 1995). Although few have yet been passed, the degree of 
interest certainly suggests possible expansion of the state's interest in regulating 
the employment of parents of minor children. And the state's abandonment of 
guaranteed support to parents of minor children might paradoxically increase 
pressure on lawmakers to legislate solutions to job/family conflict. At present, 
however, neither government nor employers in the free market have institu- 
tionalized policies or procedures for dealing with the influx of workers with 
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caregiving responsibilities into the labor force. No new model has emerged 
of a relationship between labor, management, and the state that would loosen 
managerial control over the hours and scheduling of work, allow time for child- 
bearing, and guarantee adequate substitute care for young children. 

Part of the problem stems from the fact that children need different types 
of parental care and supervision at different ages. This problem of hetero- 
geneity in parents’ needs is exacerbated by continued ideological debate over 
the appropriate approach to solving job/family conflicts—while some feel that 
employers should free workers’ time and energy to cere for their own children, 
others promote the transfer of caregiving functions from workers to other insti- 
tutions such as schools, child-care providers, hospitals, and nursing homes, etc. 
These divergent perspectives suggest different policy alternatives—extended 
parental leave for newborns versus infant day care, paid sick days for family ill- 
ness versus sick-child care, reduced or part-time worx hours for parents versus 
extension of the school day and school year. 

Another reason that employers and government have been slow to respond 
to the needs of parents has been the lack of information about the most effica- 
cious and cost-effective policies to implement. Rigorous evaluation of various 
workplace policy initiatives designed to reduce job/family stress has, in our 
opinion, been conspicuously missing. The effects of family responsive policies 
on parental behavior and children's well-being have not yet received direct at- 
tention, although child development experts contributed heavily to the debate 
on mandated parental leave by summarizing the effects of mothers' early re- 
turns to work on the physical and emotional well-being of mothers and infants 
(Brazelton 1986, Hopper & Zigler 1989). 

The remainder of this essay will (a) attempt to develop a conceptualization of 
"family responsiveness" that is rooted in an analysis of the problems employed 
parents face as well as the organizational problems that ensue from parents’ 
caregiving responsibilities, (b) review the prevalence of policies designed to 
assist employed parents and the determinants of organizational adoption of 
workplace family policies, and (c) evaluate the extent to which various policy 
initiatives achieve their objectives, whether those relate to family well-being, 
worker satisfaction, or organizational productivity. We then conclude by dis- 
cussing the disjuncture between policies that are mcst likely to solve the dilem- 
mas faced by employed parents and the solutions that employers are likely to 
adopt strategically without legislative coercion. 


THE CHANGING AMERICAN WORKER 


Ten years ago, in their review of literature in the then-burgeoning field of 
work/family conflict, Greenhaus & Beutell (1985) defined work/family conflict 
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as “a form of interrole conflict 1n which the role pressures from the work and 
family domains are mutually incompatible in some respect" (p. 77). We look 
to the survey data on employed parents to determine the sources of job/family 
conflict as well as the working conditions and policies most desired by parents 
to ameliorate those conflicts. 


What Do Parents Want 


It is difficult to state definitively what employees desire in terms of family ac- 
commodations. Some studies assessing employees' preferences for workplace 
initiatives toward reducing work/family conflict sample all employees, while 
others sample only those with current child-rearing responsibilities. Impor- 
tantly, the family needs of employees are not homogenous or static. While 
childbearing employees need leave for childbirth and infant care, parents of 
preschoolers need high-quality affordable child care and reduced work hours to 
meet the emotional needs of young children. Parents of school-aged children 
need after-school, vacation, and summer care, while parents of teens and care- 
givers of elderly parents need schedule flexibility and leave for emergencies. 
In a study on the utility of child-care programs, Kossak (1990) showed that 
these different family and work factors mean that the policies that are optimal 
for one class of employees at one point in time may have little to no effect on 
reducing work/family confiict for another class, or even for the same employees 
at another point in time. 

Keeping these methodological caveats in mind, we can draw a general picture 
of parents’ preferences. In a USA Today poll (cited in Vanderkolk & Young 
1991) in which employed parents were asked what job benefits they considered 
to be most important, 409€ of mothers and 2196 of fathers replied that family 
benefits were the most important job benefits. Another 1096 of mothers replied 
that flexible hours were most important, while 596 of fathers responded in this 
way. Together, this means that 5096 of mothers and 26% of fathers valued 
family responsive policies in their jobs more than they valued pay (64% of 
fathers and 42% of mothers considered pay to be most important). 

The National Study of the Changing Workforce, utilizing a nationally rep- 
resentative sample of 2958 employees, revealed that close to one fourth of 
employees lacking flexible schedules or the ability to work at home would 
change jobs to gain these benefits. Additionally, 4796 of those lacking leave 
time to tend to family illness said they would sacrifice pay or benefits to gain 
leave for sick family members. Finally, day-to-day family responsibilities were 
claimed by 8796 not just of parents but of all surveyed. The study says this sug- 
gests that employee pressure on businesses to become more family-responsive 
is indicative not simply of special interests, but of widespread need in the labor 
force (Social Science and the Citizen 1993/94). 
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In another nationally representative study, respondents indicated that their 
quality of family life would be improved if, first, they were to receive merit 
raises, and second, they were provided with trained, family-accommodating 
supervisors (Galinsky & Stein 1990). In a sample of business and university 
employees, most indicated that some form of schedule flexibility or work reduc- 
tion, such as flextime, part-time work, or parental leave, was their top choice 
toward 1mproving the work/family balance. Rogers (1992) reported that his 
data from 20 Fortune 500 companies showed the worx/family policy most con- 
sistently highly rated and desired by workers was full-time flexible scheduling; 
most employees with family responsibilities said thev did not want to, or could 
not afford to, work less. 

The types of family needs addressed through workplace initiatives fall into 
three general categories: (a) policies and benefits that reduce work hours to 
provide time for family caregiving through the provision of leave for vacation, 
illness, childbearing, and emergency child care or through reductions in average 
hours worked per week, (b) policies designed to give workers greater flexibil- 
ity in the scheduling of work hours and the location of work hours while not 
decreasing average work hours, and (c) policies designed to provide workplace 
social support for parents, including forms of child-care assistance so that they 
Work without concern for the care of dependents in their absence (for a similar 
categorization, see Silver & Goldscheider 1994). This categorization provides 
a useful heuristic for organizing the vast findings concerning individual-level 
and family outcomes of work/family conflict as well as for evaluating the ef- 
fectiveness of organizational work/family policies. 

Evidence of job/family conflict can be found both at the workplace and at 
home. Spillover from work to family affects the family by impairing both 
individual and family functioning. Spillover from the family to work often 
takes the form of lower productivity, higher absenteeism, and greater turnover. 
This family-to-work spillover can result in lowered career achievement for 
the employee. In the next section we investigate research on work-to-family 
spillover and its effects on family functioning and irdividual well-being. Then 
we review the available research on family-to-work spillover, focusing on the 
consequences for individual achievement and organizational functioning. 


The Effects of Work/Family Spillover 


Research has demonstrated fairly consistent links between work/family conflict 
and physical and mental health as well as to some aspects of family function- 
ing such as marital satisfaction and parenting behaviors. However, measures of 
work/family conflict often rely on parents’ self-reports of conflict levels (Bohen 
& Viveros-Long 1981, Bedeian et al 1988, Voydanoff 1988, Frone et al 1992) 
and generally fail to distinguish among different sources of job/family conflict. 
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Because these measures combine conflict stemming from overwork, schedule 
inflexibility, and unsupportive work environments, it is hard to tell how partic- 
ular sources of conflict affect various work and family outcomes. For example, 
Hughes and associates (1992) showed that nonspecific job/family incompati- 
bility decreased marital companionship and increased marital tension. Studies 
using other measures, such as overtime hours or difficulties arranging child care 
(Kossak 1990), provide insight into the effects of specific types of work/family 
conflict. The individual and family outcomes we discuss are organized around 
the categorization of work/family issues outlined above—extended work hours, 
inflexibility, and unsupportive work environments. 


WORK HOURS Extended work hours have been linked to work/family conflict 
(Piotrkowski et al 1987). Frone et al (1994) also demonstrated a link between 
lack of family time and compulsive drinking and smoking in employed mothers. 
While the relationship between Jack of time and marital solidarity/satisfaction 
remains unclear (Piotrkowski et al 1987), some evidence suggests that parents’ 
lack of time serves to diminish children’s well-being. The increase in time spent 
in work results in a decrease in time spent with children (Nock & Kingston 1988) 
and an increase in the time children spend in substitute care, much of which is 
of relatively low quality in the United States (Cost, Quality & Child Outcomes 
Study Team 1995). Although decreased quantity of time spent with children 
does not necessarily indicate that the quality of interactions with children is 
decreased (Piotrkowski et al 1987), some scholars wonder whether there are 
upper limits to work hours above which even “quality time” suffers (Louv 1990, 
Hewlett 1991). Parents are aware of their sacrifice of time with their children to 
engage in paid employment. Schor (1996), for example, reported an increase 
in parents’ interest in limiting their work time in recent surveys. Although we 
know that the lack of sensitive, responsive, and consistent care from overworked 
parents or substitute providers can lead to decreased cognitive and social skills 
(Parcel & Menaghan 1994) and can promote attachment insecurity in children 
(Belsky 1990), research is still unclear about the effects this time sacrifice may 
have on children. 


FLEXIBILITY Lack of workplace flexibility has been linked to depression in 
both women and men (Googins 1991), and to increased physical distress such as 
difficulty in falling asleep and in staying asleep, changes in appetite, tension- 
related aches and pains, etc in men (Guelzow et al 1991). Ralston (1990) 
demonstrated that the proportion of employed women with rigid work schedules 
reporting difficulties in child-care arrangements and inadequacies in family time 
was much higher than the proportion of women with more flexible schedules 
who reported these problems, although significance levels were not reported. 
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WORKPLACE SOCIAL SUPPORT Finally, conflict arising from the lack of work- 
place social support, including child-care assistance. is manifested in the mental 
health of employees. Ross & Mirowsky (1988) demonstrated that employed 
mothers who had difficulty making child-care arrangements suffered increased 
depression. In a national survey, breakdowns in child-care arrangements served 
to decrease well-being for both mothers and fathers (Galinsky 1994). This study 
also revealed that workers in unsupportive work environments—those char- 
acterized by discrimination or favoritism—experienced more negative family 
consequences. 


The Effects of Family/Work Spillover 
Family-to-work spillover also impairs organizational effectiveness, primarily 
by affecting absenteeism, productivity at work, anc turnover. Child-care diffi- 
culties in particular commonly result in these problems. Parents report difficulty 
with several aspects of child care, such as “lack of time to deal with the very 
unpredictable nature of child care" (Fernandez 1990: 21), the availability of ad- 
equate care, and the affordability of such care (Fernandez 1990). Children who 
are barred when sick from attending their regular day care must have other ar- 
rangements made for them or stay home by themselves; breakdowns in regular 
child-care arrangements require unexpected time commitments to finding alter- 
native care. Although parents may be able to find such care, their search often 
results in time lost from work or decreased productivity. Of the approximately 
2300 employed mothers in the National Child Care Survey, 15% said they had 
either been late for work, left early from work, or missed a day of work in the 
month prior to being surveyed due to failure in their child-care arrangement 
(Hofferth et al 1991). Thirty-five percent of employed mothers reported that a 
child was sick on a work day during the past month, and 51% of these mothers 
stayed home to care for their child, resulting in an average of 2.2 days missed 
during the month for a sick child. Fernandez (1990) reported that in a 1987/88 
survey of over 26,000 employees in diverse companies, 42% of women and 28% 
of men had missed work in the previous year due tc personal/family problems. 
Worker productivity is also impaired in other ways by family obligations. 
Fernandez (1986) reported that parents face work interruptions due to family 
responsibilities. These interruptions, like absenteeism, decrease as children 
grow older. Still, these interruptions were reported by 39% of mothers and 17% 
of fathers with children between ages 15 and 18 (Fernandez 1986). Fernandez 
(1990) intimated that interruptions probably occur more than parents report, but 
that parents make up for the infringement of family on work-time by working 
overtime or taking work home. Further, parental worry over sick children left 
alone at home, latchkey children who travel on their own from school to home 
to spend the afternoon alone, or the inadequacy or limited child-care choices 
has productivity consequences. One study showed that worry over children 
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resulted in wasted time and mistakes in 53% of participants (Perry 1982). When 
asked if work/family stress affected their ability to concentrate at work, 28% 
of men and 53% of women sampled from 20 Fortune 500 companies replied 
positively (Rogers 1992). Evidence from experimental studies also supports 
the idea that the family affects worker productivity. In a sample of mothers, the 
incompatibility of family and work life decreased concentration and alertness, 
which resulted in the decreased effectiveness of task-oriented work (Barling & 
MacEwen 1991). 

Another potential workplace outcome of family/work conflict is turnover. 
Felmlee (1995) found that among women, having preschool-age children was 
associated with greater rates of job changing and, in particular, job changing 
that resulted in downward mobility. Collins & Hofferth (1996) showed that the 
termination of child-care arrangements significantly increased the probability of 
employment exits among women with high per capita incomes. Other research 
reveals that women’s disproportionate responsibility in the home results in 
significantly more turnover because of family illness, household duties, and 
changes in residence (Spilerman & Schrank 1991). Glass & Riley (1996) found 
that women in jobs requiring excessive time commitments were significantly 
more likely to change jobs or exit the labor force within one year of giving birth. 
The evidence supporting the link between family/work conflict and turnover is 
substantial, although Burke (1989) failed to find a link between work/family 
conflict and turnover. 

The decreased productivity and increased absenteeism and turnover resulting 
from work/family conflict have serious consequences for the occupational at- 
tainment of workers and thus the financial stability of families. The experiences 
of women in the labor market are instructive. When family responsibilities ex- 
pand, mothers are more likely than fathers to change jobs, to work part time, 
or exit the labor force for a spell because families cannot afford to lose father's 
wages. The result is often a decrease in mothers' financial and occupational 
attainment (Felmlee 1995, Corcoran et al 1984). Because the sex gap in wages 
is increasingly explained by differences in the effects of marriage and children 
on men's and women's wages, Waldfogel (forthcoming) has suggested that we 
view work/family conflict as the culprit, either through the mechanism of em- 
ployer discrimination against mothers or employee behavioral adjustments that 
often result in curbed occupational attainment. 


ORGANIZATIONAL INTERESTS IN FAMILY 
RESPONSIVE POLICY 
Organizations also suffer when employed parents experience absenteeism, turn- 


over, and lower productivity (Spilerman & Schrank 1991, Goff et al 1990, Raabe 
1990, Fernandez 1986). Capowski (1996) reported that over 35% of departing 
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management associates at AT&T cited inability to balance work and family in 
their exit interviews; significant administrative and training costs were created 
when these employees had to be replaced. Increasingly, however, organizations 
are reporting difficulty recruiting employees, particularly managers and profes- 
sionals, in the absence of family responsive policies (Scott 1992). Indeed, one 
report indicated that over two thirds of the Fortune 500 companies have insti- 
tuted some type of family responsive program in crder to remain competitive 
in the skilled labor market (Families and Work Institute 1991). 


Prevalence of Family Responsive Policies 

in Work Organizations 

How prevalent are work/family programs at the present time? Organizational 
surveys differ in their sampling frames and definition of family responsive 
policy, but they generally agree on two points: (a) tremendous growth has 
occurred in the number of formal work/family prcgrams 1n the past 15 years, 
and (b) most large companies have by now instituted some type of initiative 
designed to address the family needs of workers for greater schedule flexibility, 
child-care services, and/or work hours reduction (Families and Work Institute 
1991, Friedman 1990, Hewitt Associates 1991). A distinction needs to be 
drawn, however, between those surveys conducted by the US Bureau of Labor 
Statistics on firms with over 100 employees and those conducted on very large 
organizations by private consulting firms, with the former finding far lower 
rates of family assistance than the latter. For example, in the hours reduction 
category, the Bureau of Labor Statistics 1989 Employee Benefits Survey shows 
that 4096 of employees held jobs covered by paic or unpaid maternity leave, 
while the 1991 Hewitt Associates survey of large employers found 5696 of 
employing establishments offering (paid or unpard) parental leave. Similarly 
the BLS notes that 11% of workers had flexible schedules available to them, 
while the Hewitt Associates survey reports that 5396 of establishments offered 
flexible schedules [a figure that rose to 7196 1n a 1995 Hewitt Associates survey 
of 681 large firms (Capowski 1996)]. Under the heading of social support, the 
BLS reported dependent-care reimbursement accounts (in which employees 
can set aside pre-tax earnings to pay for child or elder care) covered 2396 of 
employees (Cooley 1990), while Hewitt Associates found 9196 of firms offering 
dependent-care accounts. 

Work/family policies are strongly associated with firm size, a finding con- 
firmed by combined BLS statistics on benefit coverage from surveys of both 
large and small firms (small is defined as fewer than 100 employees). Grossman 
(1992) showed that maternity leave coverage dropped from 4096 to 3296 when 
small firms were included in the calculations; reimbursement accounts dropped 
from 23% to 18%. Using an earlier 1987 BLS special survey on child-care 
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benefits, Hayghe (1988) reported a clear association between firm size and the 
proportion of establishments offering any type of direct child-care assistance 
(defined as financial assistance in paying for care, on-site child care, or infor- 
mation and referral services). Eleven percent of establishments provided some 
type of child-care assistance overall, but that figure ranged from 9% for firms 
with under 50 employees to 32% for firms with over 250 employees. Using 
a broader definition of flexible scheduling (flextime, flexible leave, and volun- 
tary part-time schedules), the BLS survey on child care reported that 61% of 
establishments with over 10 employees provided some type of flexible work 
arrangement in 1987, but in contrast to other forms of assistance, there was no 
relationship between firm size and flexible scheduling (Hayghe 1988). 

While these figures overall suggest impressive progress in the organizational 
adoption of work/family benefits, these benefits are far less institutionalized 
than others designed to protect traditionally male breadwinning responsibili- 
ties. For instance, while 4096 of workers 1n the 1989 BLS survey received 
maternity leave in their jobs. 53% received leave for military service, 94% re- 
ceived life insurance, and 8196 received some type of employer provided pen- 
sion. Moreover, the extent and coverage of employer-reported family policies 
remain poorly understood. Survey questions on flextime, child-care assistance, 
and leave policies rarely explore the degree of employer commitment to the 
policy. Hyland (1990) showed, for example, wide variation in the duration 
of unpaid parental leaves for employees covered by leave policies in the 1989 
BLS survey. Hayghe (1988) and Hewitt Associates (1991) both showed that 
flextime, which alters the start and end times of the workday while maintaining 
an eight hour day/40 hour week, was far more common than more innovative 
scheduling policies such as compressed work weeks, job-sharing, and work-at- 
home arrangements. Child-care assistance is similarly diverse, with the most 
common forms of assistance being the cheapest and least helpful —dependent- 
care spending accounts and information and referral services, rather than on- 
or near-site child care or a direct employer subsidy to pay for care. Even 
within categories, employer commitment can vary. Some information and re- 
ferral services simply provide employees with lists of licensed care providers, 
while others utilize human resource staff to assist parents directly in screen- 
ing caregivers and locating openings. Direct employer subsidies for child-care 
expenses vary in the amounts committed, sometimes requiring the sacrifice of 
other benefits in "cafeteria style" benefit plans. 

Furthermore, the degree tc which access to family responsive policies is for- 
malized and extended to all employees in organizations remains suspect. In the 
BLS surveys, individual employees were not surveyed; instead, all employees 
were assumed to be covered rf any employees were covered, an assumption that 
has been challenged by various researchers. Kush & Stroh (1994) report that in 
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their survey of Chicago firms, most flextime programs were temporary ad hoc ar- 
rangements, often restricted to certain classes of employees and/or certain times 
of the year. Rather than having a permanent formal policy, many organizations 
implemented flextime in response to specific requests by individual employees. 
Capowski (1996) quotes one consultant as follows, “When companies say, ‘Oh 
we have flextime, we have telecommuting,’ what they mean is, ‘We have an in- 
dividual working here who does this.’ They don’t mean they have an integrated 
system.... The difference is that in a flexible company these options would 
be available to everyone, instead of employees having to cut deals.” Miller 
(1992) reports that women in professional and managerial positions are much 
more likely to receive family friendly benefits suck. as funded maternity leave, 
schedule flexibility, and child-care assistance thar are women in less skilled 
jobs. It may be that more highly skilled workers are concentrated in more 
responsive organizations or have access to formal benefits from which other 
classes of workers are excluded, but it is also possible that skilled workers are 
better able to individually negotiate special concessions from their employers 
because of their greater market power. Given the associations between fertility, 
education, and occupational status, this means that the mothers most in need 
of family accommodations (e.g. young single mothers with low earnings and 
little human capital) are least likely to receive them from their employers. 

Even the most optimistic observers of corporate climate admit that very few 
employers have moved beyond token acknowledzment of the family needs 
of employees (Galinsky & Stein 1990, Raabe & Gessner 1988). Moreover, 
the policies and practices of small firms, which employ over one third of the 
total US labor force and represent the strongest sources of future employment 
growth, show far less cause for optimism. The available information suggests 
that small employers offer less generous leave policies and benefits, and poorer 
working conditions in general, although they compensate for these deficiencies 
somewhat by offering a broader array of work hours and work schedules than 
do larger employers (Miller 1992, Wiatrowski 1994). 

Most human resource specialists agree that, to b2 effective, companies must 
combine various initiatives into comprehensive programs to meet the family 
needs of workers (Galinsky & Stein 1990, Smith 1992, Capowski 1996). Adop- 
tion of a single policy such as flextime is unlikely to yield great benefits to an 
organization in the absence of other types of orgarizational assistance. Raabe 
(1990) reports several studies that have shown earlier returns to work following 
childbirth among mothers when maternity leave is combined with some form 
of child-care assistance. In an effort to rate systematically the extent to which 
companies provided a comprehensive program of family responsive policies, 
Galinsky et al (Families and Work Institute 1991) developed a typology of 
stages, with stage 1 indicating no overall strategv and an emphasis on child 
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care, stage 2 indicating an integrated approach with several experimental ini- 
tiatives, and stage 3 representing integration of family responsive policy into 
overall business strategy and organizational culture. Using this typology with 
their survey of 188 large firms, they found that 79% of sampled firms had not 
moved beyond stage 1, with 33% falling below even stage 1 criteria. Using a 
comprehensive list of 22 possible types of family assistance among a regional 
sample of 178 companies with more than 50 employees, Seyler et al (1995) 
found that 80% had three or fewer types of assistance, with the modal category 
being only one type of assistance (30%). 

Researchers also agree that policies must be combined with supervisor sup- 
port and advocacy from upper management to close the gap between com- 
pany policy and practice (Smith 1992, Galinsky 1988, Raabe & Gessner 1988, 
Capowski 1996). Qualitative evidence has repeatedly revealed that employ- 
ees will not take advantage of family responsive policies, particularly leave, 
work reduction, and work schedule policies, if they feel that doing so will jeop- 
` ardize their job security, work assignments, or promotional possibilities. In 
some reported cases, employees had to receive supervisor permission before 
altering work schedules or taking leave, giving managers room to subvert for- 
mal company policy. For these reasons, consultants now routinely recommend 
management training as part of the implementation of work/family programs 
(Smith 1992). 


Determinants of Organizational Adoption 

of Family Responsive Policies 

Because the data on prevalence reveal great diversity in the extent to which 
employers have adopted family responsive policies, we turn now to the avail- 
able literature on the organizational innovation and diffusion of work/family 
programs. Some scholars have used a rational choice perspective in which 
efficiency considerations either compel or retard organizational solutions to 
work/family conflict (Glass & Fujimoto 1995, Osterman 1995, Seyler et al 
1995, Auerbach 1990). Others have explored the application of institutional 
theory to explain why employers implement work/family programs, focusing 
on isomorphism within organizational fields and the effects of professional per- 
sonnel administration on employee benefits {Osterman 1995, Goodstein 1994, 
Glass & Fujimoto 1995, Auerbach 1990). Both perspectives suggest that or- 
ganizations with predominantly female workforces and large firms in general 
should adopt policies earlier and more comprehensively than others, while insti- 
tutional theory predicts quicker adoption of work/family policies in firms with 
formal personnel administration and among firms with frequent interactions 
with other organizations that have implemented work/family policies. Both 
perspectives also emphasize the importance of search and replacement costs 
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in the instigation of efforts to reduce turnover—from an efficiency perspective 
this suggests that organizations with large training costs or greater reliance on 
skilled labor should more quickly unplement family responsive programs, while 
an institutional perspective emphasizes the role of well-developed internal la- 
bor markets and high-commitment work systems (-nvolving self-managed work 
teams and worker involvement in managerial decision-making) in the adoption 
of work/family programs. 

The literature on this topic is just beginning to substantiate certain empirical 
regularities in the organizational characteristics cf firms implementing family 
responsive policy. Using the total number of policiss adopted by an organization 
(a rather crude measure given the variation in cost and employer commitment 
required by different policies), both Seyler et al £1995) and Osterman (1995) 
found that female concentration within firms and firm size were important deter- 
minants of the number of policies offered, although Osterman’s results showed 
that the effect of female concentration disappeared after occupational controls 
were introduced. Using a categorization scheme that emphasized the compre- 
hensiveness of coverage as well as the number of policies offered, Goodstein 
(1994) also showed dominant effects of female concentration and firm size. 
However, Glass & Fujimoto (1995), using a sample of employed women, found 
that firm size was positively related only to formal benefits, while informal leave 
policies and schedule policies that depended on supervisor cooperation (work- 
ing at home or using sick leave for children’s illnesses) were more likely to 
occur in small firms. 

Regarding formal personnel management and the need to remain isomorphic 
with the organization's environment (key components of institutional theory), 
Osterman (1995) found that the presence of human resource departments in- 
creased the number of policies offered, while Goodstein (1994) found large 
effects of prior diffusion of work/family programs in the organization's indus- 
try group. But the conception of "institutional isomorphism" used to explore 
diffusion of family responsive policy may need to be broadened. Friedman 
(1990), Auerbach (1990), and others have noted that firms whose product mar- 
kets target families or have family-sensitive constituents (government or non- 
profit social welfare organizations) are often more likely to be committed to 
workplace family responsive policies in principle and to see the implementation 
of family responsive policy as good for improving their public image. 

The results of studies designed to test the effects of search and replacement 
costs on the adoption of work/family policies have been mixed. Seyler et al 
(1995) did not find effects of training costs or. the number of work/family 
policies offered, although they did find that larger recruiting budgets predicted 
greater financial aid policies for workers with dependent care responsibilities. 
Osterman (1995) found large effects of high-com mitment work systems on the 
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number of family responsive policies offered, as well as strong indications that 
firms relying on professional and technical workers were more likely to adopt 
policies, but did not find any effect of the presence of an internal labor market. 
Glass & Fujimoto (1995) also found that professional and managerial workers 
were more likely to be covered by formal leave policies and to be able to work 
regular hours at home. 

The empirical work in this area, as in others, would be greatly improved by a 
more uniform and adequate conceptualization of family responsive policy, one 
that accounts for both the different types of assistance offered and the inten- 
sity of employer commitment to policies (including training of managers and 
organizational support for employees using these benefits). Another difficulty 
plaguing investigation of organizational responses to family needs of employ- 
ees is the inadequate measurement of work/family policies—some studies look 
only at formally institutionalized policies while others include informal work 
arrangements negotiated by individual employees. Perhaps part of the strong 
empirical association between firm size and work/family policies comes from 
the failure of small firms to have formal personnel policies in general, although 
they may actually be more receptive to individually negotiated arrangements 
than are more bureaucratic workplaces. It is also possible that large firms 
by their size alone are more likely to have at least a few employees on infor- 
mally negotiated work schedules, and hence they are more likely to report the 
existence of family responsive policies when in fact those policies are only 
weakly institutionalized. 


EVALUATION OF FAMILY RESPONSIVE POLICIES 


Organizational Effects of Family Responsive Policies 

Evaluations of the effects of work/family initiatives focus primarily on em- 
ployee recruitment, turnover, absenteeism, and productivity. While the litera- 
ture suggests that organizations sometimes adopt family responsive policies in 
response to other concerns—the desire to avoid unionization, forestall govern- 
ment regulation, create a positive public image with important constituents, etc 
(Friedman 1990, Auerbach 1990) —the available literature centers on the cost 
effectiveness of various family responsive policies. Though some work/family 
policies are praised as productivity-enhancers and money-savers, the empirical 
evidence is rife with problems that impede firm conclusions. 

The main impediments to good evaluation research on the productivity ef- 
fects of family responsive policy have been summarized by Auerbach (1990: 
395) as: “a) defining and measuring productivity, b) imputing causality be- 
tween the [family responsive] benefit and outcomes, and c) determining whether 
impacts... are worth the program's cost to the employer" The few studies that 
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exist often use perceptual measures of work/familv policy outcomes rather than 
behavioral measures of these effects. The lack of longitudinal and compara- 
tive data poses another problem in assessing the organizational effectiveness 
of family responsive policies (Raabe 1990). Without such information, we 
really do not know how organizational outcomes differ given a different mix 
of work/family policies. Family responsive policies and their organizational 
effects will be evaluated in light of these methodclogical concerns. 


WORK HOURS Policies that address employees’ needs for reduced work hours 
have been linked to both increased organizational productivity and decreased 
turnover. In one study, 6296 of permanent part-time employees attributed in- 
creased productivity to part-time employment. Further, for workers in exper- 
imental part-time programs who were doing their previously full-time jobs in 
part-time hours, productivity appears to have bezn increased (Rogers 1992). 
However, because these studies rely on soft date in the form of self-reports, 
the link between reduced work hours and individual productivity is not well 
established. Findings relating part-time work to turnover suffer from similar 
problems. Rogers (1992) reported that nearly all of the 30 respondents in a 
part-time employment program said that, in the absence of the ability to re- 
duce their work hours, they would have left the lebor force. However, the link 
between reduced work hours and turnover was nct directly tested. 


SCHEDULE FLEXIBILITY Flextime is the most widespread and time-tested of 
work/family policies. In their review of research on the organizational and 
individual-level effectiveness of flextime programs, Christensen & Staines 
(1990) found that these policies decreased tardiness, absenteeism, and turnover. 
Another study utilizing an experimental design demonstrated significant re- 
ductions in absenteeism with the implementation of flexible scheduling in the 
experimental group, although this study failed tc find a link between flexible 
scheduling and turnover (Dalton & Mesch 1990). Further evidence of the effi- 
cacy of flexibility for organizations is offered by Rogers (1992). His research 
shows employees ranked the importance of the flexibility to balance work and 
family fourth out of sixteen factors in their decision to stay with the company. In 
the subsample of employees rated as high performers, flexibility ranked second 
only to compensation (Rogers 1992). Finally, other research has demonstrated 
a positive association between flextime policies and job satisfaction (Thomas 
& Ganster 1995, Christensen & Staines 1990). Given the documented link 
between job satisfaction and turnover (Mueller & Price 1990), policies that 
increase job satisfaction should be indirectly and negatively related to turnover. 


WORKPLACE SOCIAL SUPPORT Formal social support policies, particularly on- 
site or employer assistance with child care, have shown limited organizational 
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effects. In a study relying on supervisors’ perceptions of job performance and 
child-care related absences, no link between on-site child care and absenteeism 
was found (Kossek & Nichol 1992). However, on-site child care positively af- 
fected employees’ decisions to remain employed at the company and increased 
the extent to which they used the on-site child-care policy as a recruitment 
tool (Kossek & Nichol 1992). Youngblood & Chambers-Cook (1984) reported 
similar findings concerning on-site child care and turnover, and they also failed 
to find a link between on-site child care and absenteeism. Additionally, Goff 
et al (1990) failed to find support for the linkage between on-site child care 
and absenteeism. It appears that on-site child care may be more important 
for recruitment and retention than dealing with actual day-to-day productivity 
concerns such as absenteeism. If there is no provision for sick children at the 
on-site child-care center, parents still have few options for child care when their 
children are sick, and thus they may provide care themselves. 


Effects of Family Responsive Policies on Families 


Individual workers and their families also benefit from the implementation of 
work/family policies, although few studies directly assess parenting behaviors 
or child outcomes. In this section, we evaluate the effects of flextime policies, 
reduced work hours, and both formal and informal social support on parents 
and their families. 


REDUCED WORK HOURS Reduced work hours have been linked to increased 
mental health. Hyde et al (1995) showed that the combination of short ma- 
ternity leave and marital concerns increased depression in women, while the 
combination of short maternity leave and low levels of job rewards resulted 
in increased anger in women. Longer time off work postpartum, in combi- 
nation with other factors, served to decrease depression and anger. Similarly, 
full-time employment has been linked to increased anxiety levels in mothers. 
Homemakers and mothers working part-time displayed lower levels of anxiety 
than mothers who were employed full-time (Hyde et al 1995). 


SCHEDULE FLEXIBILITY Some research has failed to find a link between for- 
mal flextime policies and reduced work/family conflict (Shinn et al 1989), while 
other research shows associations between flextime and many individual and 
family outcomes. In a study of nurses (99% of whom were mothers), flex- 
time was linked to decreased work/family conflict, decreased depression, fewer 
somatic complaints, and lower blood cholesterol (Thomas & Ganster 1995). 
Other studies reveal that flextime policies can increase time spent with family 
(Winnet et al 1982) and time spent on family needs (Christensen & Staines 
1990). Flextime has also been linked to decreases in employee perceptions of 
work/family conflict, although it appears to have the least beneficial effects for 
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employed women with husband and children presert. Decreases in work/family 
conflict due to flextime policies are most enhanced [or those employees without 
primary child-care responsibilities (Christensen & Staines 1990). 


SOCIAL SUPPORT Finally, workplace social support also serves to enhance 
families’ well-being. Supportive supervisors have been linked to decreased 
symptoms of health problems in married men (Greenberger et al 1989), as 
well as decreased work/family conflict for parents of preschoolers (Warren & 
Johnson 1995, Goffetal 1990). However, in a sample of parents of preschoolers, 
on-site child care was unrelated to work/family corflict. On-site child care may 
not be as helpful to parents as diffuse policies such as flextime, reduced work 
hours, and supervisory support, which allow parents to take care of their family 
responsibilities on their own time. 


Summary 
Although well-designed evaluative studies are still lacking in many areas, some 
general conclusions can be drawn about the effects of work/family policies on 
organizational and family functioning. Decreased work hours serve business 
by increasing employee productivity and decreasing turnover, and they serve 
families by decreasing depression in employees. Flextime policies increase 
employee productivity by decreasing absenteeism and turnover, and they pos- 
itively influence family functioning by decreasing employee depression and 
work/family conflict while increasing the time families spend together. Im- 
portantly, most research conceptualizes schedule flexibility rather narrowly. 
This focus leaves unexamined policies that allow employees more autonomy 
in structuring their work hours. 

The effects of workplace social support are mixed. On-site child care appears 
to have little effect on either family or organizational functioning, although a 
link between employer-provided child care and decreased turnover has been 
demonstrated. That employer-provided child care does not affect absenteeism, 
depression, or work/family conflict is somewhat surprising. The clearest ad- 
vantage of on-site child care is the increase in the availability of child care; but 
the quality, hours of operation, and cost of on-sit2 child care may not be sub- 
stantially different than for off-site care. Further, even slots in on-site child-care 
centers may be limited, meaning spaces may not be available for all employees 
seeking care. These problems likely limit the effectiveness of on-site child 
care. However, evidence suggests that informal social support, in the form of 
sympathetic supervisors, does have positive effects on employees, decreasing 
symptoms of health problems and work/family conflict. 
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BARRIERS TO FURTHER INSTITUTIONALIZATION 
OF FAMILY RESPONSIVE POLICY 


Scholars are still debating whether organizational recruitment, retention, and 
productivity problems are sufficient in themselves to prompt widespread adop- 
tion of family responsive policies. Effective family responsive policies cost 
work organizations, particularly in the early stages of development and imple- 
mentation. In general, the economic motivations to institute family responsive 
policies remain unclear at best. While work/family programs have generated 
enormous interest within professional human resource management and are the 
subject of numerous publications and books, solid research evidence showing 
economic benefits to corporations instituting work/family programs has been 
difficult to document. Several scholars doubt that work/family programs can 
ever be viewed as economically preferable to the simple exclusion of work- 
ers with dependent care responsibilities (Hunt & Hunt 1982, Kingston 1990). 
Tactically, employers may find that statistical discrimination against workers 
planning or having child-care responsibilities (mostly young women) may be 
economically rational, particularly for positions involving employer-provided 
training or entailing large search and replacement costs. For other positions, 
employers may be willing to hire those with actual or expected caregiving 
needs because they choose to tolerate greater turnover and ensuing search and 
replacement costs rather than expend resources on work/family policies. 
While evidence suggests that employee demand for family responsive policy 
is increasing, it is nevertheless the case that most workers only need intensive 
employer accommodation to family needs for the short period in their life cycle 
when they have young children at home. Most employees may eventually need 
family responsive accommodations but not continuously over their life course, 
thus muting employee pressure on employers at any given time. Unions have 
expressed increased interest in negotiating family responsive policy as part of 
their interest in employment sectors predominantly employing women. How- 
ever, whether unions will be successful in achieving family responsive benefits 
remains unknown, particularly in this era of declining union membership. 
Given the current economic climate in the United States, workers may face 
even bigger obstacles to family responsive policy in the coming years. While 
both employers and employees want a "flexible workforce," that term means 
vastly different things to those two groups. Pressured by shareholders de- 
manding strong economic performance and international competitors seeking 
to undermine domestic producers, many companies are downsizing their labor 
force, extracting longer work hours from those that remain, and encourag- 
ing the use of temporary and part-time workers paid lower wages and few if 
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any benefits (Smith 1993, Schor 1992). To employers, this represents a new 
“flexible workforce.” In contrast, employees use the term to indicate greater 
willingness to prorate benefits for part-time workers or job-sharing partners, 
allow work at home, and encourage creative scheduling such as fewer work 
days or staggered work hours to accommodate family schedules. Clearly, the 
forces of globalization and domestic economic restructuring are encouraging 
employers to increase productivity and exert more control over labor costs, 
goals that are often at odds with workers’ attempts zo increase their family time 
and preserve their families’ well-being. 

There may be cause for at least cautious optimism. Some human resource 
specialists are promoting the use of family responsive work options instead of 
the existing corporate tools of downsizing and subcontracting to deal with the 
ebb and flow of labor demand. Offering part-time work with prorated benefits 
instead of early retirement or layoffs, and using job-sharing, voluntarily reduced 
work hours, and work-at-home arrangements to cut payroll and physical plant 
costs are ways that companies can offer true flexibility while still maintaining 
profitability under intense competitive pressures (Scott 1992, Capowski 1996). 
Yet this win-win scenario assumes that the timing of workers’ family needs 
will coincide with employers' needs to trim costs; moreover, it assumes that 
employers can cut costs and still meet caregivers' needs for adequate income 
(a questionable assumption given the costs of supporting dependents). 


FUTURE RESEARCH AND POLICY DIRECTIONS 


This essay has focused on the needs of parents and organizations for accom- 
modations to employees' family obligations, the prevalence and organizational 
diffusion of specific types of assistance, and the evaluation of those types of as- 
sistance on both organizational effectiveness and employee family functioning. 
However, the motivations of parents and employers are not the same, nor are 
employees and employers equally powerful in determining the initiatives that 
achieve widespread adoption. For example, some scholars have castigated the 
corporate consultants who promote work/family programs for their neglect of 
the most compelling needs expressed by employed parents—for higher wages 
and job security (Kingston 1990, Raabe 1990). A more radical view of fam- 
ily responsiveness would promote workplace policies that raise wages while 
avoiding both excessive work hours and frequent layoffs or downsizing. 

In contrast, employers' motivation is to ensure an adequate, trained supply 
of labor to the firm at the lowest possible cost. That translates into policies 
that help employees cope with their childrearing or elder care responsibilities 
without actually increasing the time or money employees have to fulfill those 
responsibilities themselves. Family responsive policies that free workers to care 
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for their own dependents cut into the unfettered supply of labor to the firm and 
thus are less likely to be institutionalized in the business community without 
considerable external pressure (Glass & Fujimoto 1995). While some classes 
of workers, particularly those with greater market power, may get concessions 
that limit their employer's ability to schedule work at will, such concessions 
are unlikely to be extended to the labor force as a whole. 

Some examples of the most popular work/family initiatives succinctly il- 
lustrate this problem. Flextime has become a commonplace, low-cost policy 
among employers. Yet flextime typically requires work throughout the “core” 
hours of 10 am to 3 pm. Moreover, most employees must obtain supervisor 
consent before they change their specific start and end times of work. These 
restrictions prevent the kind of flexibility parents require to attend school events, 
consult with teachers or health care workers, and stagger work hours with a co- 
operating spouse to increase parental time with children. The really flexible 
scheduling options for parents—work at home, compressed work-weeks, free- 
dom to schedule particular days or hours at work, freedom to leave work for 
emergencies, etc—are far more rare. Work reduction options, such as job- 
sharing, and reduced-hour work weeks are even more difficult to find (Russell 
1988). 

The most popular child-care initiatives, such as information and referral 
services and flexible spending accounts, are equally problematic. Information 
and referral services often only provide employees with lists of licensed or 
registered care providers in the local community. In themselves, these services 
do nothing to improve the supply, quality, or cost of child care in the community. 
Just as importantly, they provide no guarantees to parents that the listed child- 
care providers are adequately trained, have spaces available, or will stay in 
business. Flexible spending accounts permit employees to use pre-tax earnings 
to pay for dependent care, substantially lowering the cost of care for earners in 
higher earnings brackets. However, tax codes already enable most employed 
parents to save the same amount through the child and dependent care tax 
credit. Moreover, low-income workers who owe no federal income tax obtain 
no benefit at all from these spending accounts. 

The available evidence suggests collective failure to provide policies to those 
most in need—low income workers, young and/or single parents at the early 
stages of their work careers. A more realistic assumption may be that federal 
intervention will be required to level the playing field for all employers and 
employees with caregiving responsibilities, in much the same way the mini- 
mum wage ensures that labor exploitation does not allow some employers to 
prosper at the expense of family and community well-being. This has been the 
path followed by the welfare states of Europe, with accompanying increases in 
taxation in return (Kamerman & Kahn 1987). However, the United States is 
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moving toward decreased federal and increased state responsibility for social 
welfare. Without federal minimum safeguards, it seems unlikely that caregivers 
will be able to advance 1n the labor market as well as those without caregiving 
obligations, exacerbating existing problems with child welfare (Spilerman & 
Schrank 1991). 

To develop any type of political consensus on how public policy should assist 
employed parents, more and better research is needed on the consequences of 
specific work/family initiatives. In particular, the lack of research directly link- 
ing employer policies to parental functioning and child well-being represents 
a formidable obstacle to policy-making. Moreover, the existing research on 
costs and benefits to employers needs to be supplemented with research using 
more rigorous and clearly defined measures of organizational outcomes, as well 
as more representative samples of employers (including small firms). Finally, 
we advocate closer attention to the measurement of family responsiveness it- 
self, focusing on three central dimensions: the type of policy (hours reduction, 
schedule flexibility, or social support), the intensity of the policy (the degree 
of employer commitment to the policy objective), and the formalization of the 
policy (the extent to which the policy is available to all employees). With better 
measurement, the comparative impact of leaves and hours reduction, schedule 
flexibility, and social support on the functioning of families and organizations 
can be assessed, as well as the effects of different policy combinations. This 
remains the research agenda for the future. 
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ABSTRACT 

A growing body of social science literature has examined the organizational inno- 
vations and staffing practices comprising new flexible forms of work. Researchers 
have investigated the depth and scope of these changes and questioned how they 
affect diverse groups of workers in the United States. Reviewing the research on 
this transformation reveals a model of combined and uneven flexibility, charac- 
terized by the opening of opportunities that are differentially distributed across 
different groups of Àmerican workers, emerging under conditions in which effort 
is intensified, control is decentered, and employment is destabilized. The essay 
concludes by suggesting additional areas of inquiry for sociologists concerned 
with new forms of work organization. 


INTRODUCTION 


Simultaneous, mutually conditioning changes in global economic conditions 
and in organizational and employment innovations are dramatically reconfigur- 
ing a spectrum of work settings in the postindustrial United States (Aronowitz 
& DiFazio 1994, Block 1990, Harrison 1994, Reich 1991, Sabel 1991, Sayer 
& Walker 1992, Zuboff 1988). Recent social science literature on work, or- 
ganizational change, technology, and industrial relations has focused on new 
flexible work systems (Osterman 1994) that have surfaced from the currents of 
those changes, and researchers have predictably debated their consequences. 
In the more idealized accounts of work flexibility, newly skilled, continually 
learning, empowered and engaged workers, aided by entrepreneurial managers, 
strive to relax and flatten rigid bureaucracies, trim excessive use of organiza- 
tional resources (including time, space, and people), and use their experiential 
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knowledge to improve the way they produce goods cr serve people (Heydebrand 
1989, Kanter 1989). This revolution in work and organizational practice, it is 
argued, should align workers’ interests with those of their managers, enhance 
productivity, innovativeness, and quality, and ther2by enable American firms 
to regain the competitiveness and profitability that they took for granted in the 
postwar era (Dep. of Labor Fact Finding Report 1994, Nadler & Lawler 1983). 

Critics of flexible systems, in contrast, see them as little more than a new 
permutation of work that disadvantages workers but offers employers signif- 
icant dividends. Flexibility, in this view, may cal for new production tech- 
niques or novel work group formations but doesn't entail a substantive break 
with traditional hierarchical modes of control and authority relations; rather, it 
embodies and even deepens them by obscuring power behind participatory lan- 
guage (Pollert 1988). Some have feared as well that flexible work systems, and 
the collaborative work relations on which such systems are based, will erode 
the power of the organized labor movement (Fantasia et al 1988), while others 
simply doubt that the corporate rhetoric about flexibility has materialized in 
real changes in the American workplace (Gordon :996). 

This essay strives to unpack and move beyond the polarized conceptions 
of flexibility by reviewing research on new forms of work organization and 
specifying their outcomes for different groups of workers. I first summarize 
the global context that has led many employers and managers to endorse the 
flexible model of work, normatively if not always in practice (Appelbaum & 
Batt 1994), and then I map out two interrelated, frequently overlapping elements 
of flexibility found in a variety of occupational and industrial settings. 

The first element corresponds to what has been termed "functional" flexi- 
bility (Wood 1989) and includes organizational mechanisms and work flow in- 
novations that "build in" employee involvement: new technologies, inventory 
methods, job enlargement schemes, self-managed teams, and quality circles. 
This set of innovations is premised on securing the deeper engagement of core 
workers, on continually training them, and on exploiting their accumulated 
knowledge and experience. While these innovations directly reshape the work 
of blue- and white-collar nonmanagerial workers, they indirectly reshape the 
function of supervision and management as well. 

The second corresponds to what has been termed "numerical" flexibility 
(Wood 1989) and refers to the ascent of contingent jobs and workers and to the 
decline of the permanent employment model, with the latter directly affecting 
professionals and managers. Research on these two sets of changes points 
to a model of combined and uneven flexibility, characterized by the opening 
of opportunities that are differentially distributed across different groups of 
American workers, emerging under conditions in which effort is intensified, 
control is decentered, and employment is destabilized. The essay concludes by 
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suggesting research directions for advancing the sociological understanding of 
changes in the contemporary workplace. 


THE GLOBAL CONTEXT: WHAT’S CHANGED? 


The debate about flexible work systems has its origin in a series of deep-seated 
changes in manufacturing and production industries. In the postwar years, 
American mass-production-based industries dominated global product markets. 
Labor and management in core-sector firms forged an accord in this period that 
gave employers the right to make all decisions about production processes but 
in turn entitled blue- and white-collar workers, nearly all male, to a living 
wage and secure employment. Central to the postwar accord was a production 
infrastructure premised on hierarchy, standardization, and routinization that 
excluded workers from decision-making and authority but enabled productivity 
to rise and profits to soar (Womack et al 1990). This "Fordist" mass-production 
system (Harvey 1989), well-suited to an economic framework geared for stable, 
continuous output of huge product mins, was an “engine of growth” of the 
postwar period (Piore & Sabel 1984:183), and, in conjunction with favorable 
international conditions, enabled the United States to remain firmly in the lead 
for several decades. 

Although ensuring financial success for American companies, this produc- 
tion system, requiring workers’ labor power but formally sidelining their brain 
power and their initiative, incurred significant psychological and emotional 
costs to both blue- and white-collar workers: high dissatisfaction, boredom, 
alienation, and low self-esteem (Aronowitz 1973, Work in America 1973). Fur- 
thermore, the accord excluded a significant number of Americans who lacked 
access to the secure, “good” jobs of the postwar era. Instability, unpredictabil- 
ity, and low compensation were the hallmarks of employment for white women 
who worked in the labor force and for women and men of nonwhite racial 
and ethnic groups, who were confined largely to the secondary labor market 
(Gordon et al 1982) or who were otherwise underutilized (Sullivan 1978). 

In the 1970s, American corporate profitability faced serious challenges, and 
the US economy began to lose its premier position in the international econ- 
omy (Bluestone & Harrison 1982, Bowles et al 1984, Obey & Sarbanes 1986). 
Developing countries gained the capacity to mass produce goods more competi- 
tively. Thus, finding ways of changing product lines swiftly became imperative 
for US firms. Japan’s economic success inspired widespread debate about 
the feasibility of and limitations to importing, into the American context, the 
Japanese lean production techniques (Abo 1994, Cole 1989, Kenney & Florida 
1993, Milkman 1991, Wood 1989). As other economies began to compete with 
and overtake the United States in numerous markets, and as the United States 
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struggled to maintain industrial competitiveness, researchers and management 
practitioners began to scapegoat the very business practices once considered 
to be the engine of growth, identifying them instead as accelerating America’s 
decline. 

In response to such national and international transformations, social sci- 
entists turned their attention to new forms of work that might undo the worst 
effects of the mass production orientation. Early on, theorists of flexible spe- 
cialization (Piore & Sabel 1984) argued that work systems that tapped craft 
knowledge, increased participation, and provided continual training would en- 
able workers on the shop floor to specialize flexibly and to vary their product 
lines rapidly and with high quality. Others focused on new manufacturing sys- 
tems (Hirschhorn 1984) powered by advances in cybernetic technology that 
were driving, if not requiring, employers to adopt more flexible ways of or- 
ganizing workers. Manufacturing work settings defined by standardization, 
hierarchy, constraint, and the curtailment of workers’ input, would give way to 
new sociotechnical systems in which workers would be empowered to become 
pattern-finders and problem-solvers, to learn from their errors, and to deploy 
and redeploy machines for a multiplicity of purposes. Fluidity, a climate for 
ongoing learning, and meaningful participation could be imported into the cor- 
porate, white-collar world of work as well, providing a foundation for a more 
integrated, nonhierarchical, and innovative work environment (Kanter 1989). 

The new emphasis on participation and flexibility was not without its detrac- 
tors: Although American companies, throughout the 1980s, experimented ex- 
tensively with small-group activities, work humanization, and job enrichment 
programs, critics of the quality and participative movement derided these as 
faddish, predicted early death, and doubted that the swell of interest in worker 
participation would be matched by a deep-seated commitment to genuinely 
empowering workers (see Cole 1989, Cole 1995). 

Much of the research reviewed in this essay has explored and tested the 
expectations of this first wave of literature. A core expectation was that flexible 
work systems could be devised that not only would draw on workers’ knowledge 
and judgment, but would privilege their knowledge above that of supervisors 
and managers by engaging them in design, planning, and identifying and solving 
problems. Combating the stultifying, alienating effects of mass production and 
scientific management, this innovation would crea:e a basis for increasing the 
commitments of workers and lead to the convergence of their organizational 
interests with those of management. 

To achieve this, flexible work arrangements wold decentralize structurally, 
leaving production and service workers in comparatively autonomous positions. 
Flexible systems would institutionalize involvement, creating opportunities in 
which workers could interpret information, act on their experiential knowl- 
edge, make decisions in a timely way, spontaneously innovate in accordance 
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with shifting product and service demands, and maximize their efforts by col- 
laborating with others in offices and on shop floors. Specific organizational 
innovations most likely to enable workers to carry out these goals included 
quality circles, employee involvement programs, job enlargement and rota- 
tion, self-managing teams, continuous improvement processes, organizational 
decentralization, and the promulgation of a new ethos of participation. Just-in- 
time inventory procedures, outsourcing, and techniques for fine-tuning the size 
of the work force would institutionalize flexibility more firmly, enabling firms 
to move swiftly, to cut through excess and minimize waste. 

As we near the end of the twentieth century, preoccupation with the possible 
benefits of flexibility has diffused throughout the American work and occupa- 
tional system. Although the precise extent of the implementation of flexible 
practices is difficult to measure, there is little doubt that the new model is per- 
vasive, even if unevenly developed across occupational, organizational, and 
industrial settings. Drawing on a survey of a random sample of US manufac- 
turing establishments, Osterman (1994) estimated that about 3596 of private 
sector firms with 50 or more employees had made substantial use of flexible 
work organization (that is, firms had implemented two or more flexible work 
innovations). Other studies (Appelbaum & Batt 1994:60) suggest that the pro- 
portion of Fortune 1000 firms with at least one employee-involvement practice 
stood at about 8596 in 1990. 

Unionized workers, although often reluctant participants, have helped craft a 
cooperative, high-trust industrial relations framework to enable American firms 
to adapt and change more rapidly (Bluestone & Bluestone 1992, Cornfield 1987, 
Dep. of Labor Fact Finding Report 1994, Heckscher 1988, Kochan et al 1986, 
Kochan & Osterman 1994). Assumptions about the benefits of participation 
have extended into nonunion manufacturing workplaces (Graham 1995) and 
service settings (Appelbaum & Batt 1994: Ch. 7, Smith 1990, Smith 1994), 
although research on flexibility in service work lags far behind research on 
manufacturing and production settings. And those writing popular manage- 
ment and organizational literature have circulated the faith in the new model of 
Work by criticizing bureaucracy and by praising individual flexibility, empower- 
ment, and organizational change (Davidow & Malone 1992, Peters & Waterman 
1982). In other words, in response to the competitive turmoil of recent years, 
a new model of flexible work has seeped into corporate rhetoric and employer 
strategy. The social science literature on this seepage illuminates various, often 
contradictory, dimensions and consequences of the flexible model. 


INVOLVED WORKERS, FLEXIBLE PRODUCTION 


Under the broader rubric of engaging production and service workers, organiz- 
ing them to bring their hands-on knowledge and experientially-based judgment 
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to bear on the way they do their work, enabling them to act in unpredictable 
and changing circumstances without having to seek formal authorization from 
managers, and giving them a stake in the outcomes of their actions, many work 
organizations have “built in" a number of interrelaied organizational innova- 
tions. Researchers from the fields of management, organizational behavior, and 
economics have argued that these structural and proc2ssual innovations can lead 
to increased organizational productivity (Cooke 1989, Margulies & Black 1987, 
Rosenberg & Rosenstein 1980) and reduction of abs2nteeism, minor accidents, 
grievances, and quits (Havlovic 1991), but others caution against a simple causal 
relation between participation schemes and productivity increases (Levitan & 
Werneke 1985). The US Department of Labor’s Dunlap Commission reports 
that efforts to establish positive correlations between participative innovations, 
on the one hand, and productivity, efficiency, and organizational effectiveness, 
on the other, have shown “mixed results” at best (Dep. of Labor Fact Finding 
Report 1994:45). 

The studies reviewed in this section do not attempt to measure these par- 
ticular administrative outcomes; they focus instead on issues such as control 
and consent, autonomy, hierarchy, and skills that are more central to the soci- 
ology of work, occupations, and organizations. This section reviews literature 
on two important aspects of organizational redesign: structural/labor process 
innovations and social relational change. Combined, they are constitutive of a 
new sociotechnical system predicated on limited flexibility and participation. 
Findings about organizational redesign do not support the claim for genuine de- 
centralization and empowerment that has been so pervasive in the call for new 
ways of working, nor do they support the claim that there is nothing new about 
the flexible model. Flexible practices can amplifv workers' responsibilities 
and degree of involvement, and in many cases provide openings to learn new 
skills and competencies. But these higher levels of participation are structured, 
constrained, and controlled in less visible although no less powerful ways, and 
workers can be held responsible for outcomes that wzre once the near-exclusive 
responsibility of supervisors and managers. 


Structural, Organizational, and Technological 

Reorganization 

JUST-IN-TIME INVENTORY PRACTICES The goal of jast-in-time inventory (JIT) 
systems in manufacturing is to achieve flexibility by minimizing parts, materi- 
als, and resources on hand, cutting back and eliminating waste, and improving 
product quality. In the “work-in-progress” (Adler 1992:135) or "just-in-case" 
(Kenney & Florida 1993:168) approaches to inventory accumulation typical of 
mass-production settings, companies maintain large stores of supplies, kept in 
accordance with predictions for long, stable produc: runs. JIT, modeled on the 
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Japanese kanban system, relies on outside suppliers to provide parts only and 
exactly when needed, in response to shifts in product demand or product spec- 
ification (Piore & Sabel 1984). Production ideally is pared down to its bare 
bones so as to permit workers to shift quickly into new, specialized product 
batches and to identify defective parts or products before too many of them 
reach the end of the production line. JIT incorporates a participative model 
because it requires greater worker attentiveness to the timing of the production 
cycle, to planning and matching inventory to manufacturing need, and to the 
quality of their products. Factory workers often help develop new production 
lay-outs to facilitate the JIT system, learn the statistical control tools they need 
to track production, and become experts in problem- and defect-identification 
(Dawson & Webb 1989), potentially transferable skill sets of some value in a 
postindustrial economy. 

Just-in-time systems also create a new set of pressures for workers, including 
intensified involvement, heightened accountability, and the need for vigilant 
watchfulness. JIT speeds up production processes, eliminates any idle time, 
yet makes the coordination of production more tenuous, with the sum effect of 
adding to workers’ stress levels (Gottfried & Graham 1993, Kenney & Florida 
1993, Sayer & Walker 1992). In the course of working on the line in a Subaru- 
Isuzu assembly plant in the midwest, Graham (1995) found that workers were 
often left waiting until the last minute for crucial inventory to be delivered 
by outside suppliers, an observation confirmed in Adler's (1992:136) study of 
the NUMMI auto plant. Workers both fell behind while waiting for critical 
components and ran to keep up when they had to install parts that either arrived 
late or required altering because they were of inferior quality or did not meet 
precise product specifications (Graham 1995:114). 

JIT heightens workers’ responsibility while heightening their visibility to 
management surveillance (Sewell & Wilkinson 1992, Shaiken et al 1986, 
Skorstad 1991). The system reduces and even eliminates temporal and spa- 
tial buffers that were once crucial for workers in maintaining some degree of 
autonomy from management. Workers have little opportunity to step back; they 
are also more tightly coupled to individuals on stations on either side of them, 
usually through mandatory collaboration. 

The JIT system additionally can lead to a reduction of total labor requirements 
in the firm (Dawson & Webb 1989), a reduction of production workers’ skills 
(Milkman & Pullman 1991), and increased difficulty and complexity in work 
scheduling (Brown & Mitchell 1991). Even ardent proponents of JIT acknowl- 
edge that it entails a “new, faster-paced, more intensive labor process” which 
fundamentally “pumps work out of workers” (Kenney & Florida 1993:264). 
Importantly, extra and intensified work may not be matched by additional com- 
pensation. In a service-delivery organization with a just-in-time production 
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process, workers who were in the role of “team leaders” and were assigned the 
task of overseeing production in an organizationally decentralized work site, 
received no extra pay and were not considered to 5e formal supervisors (Smith 
1994), a finding echoed in other studies. 


JOB EXPANSION AND ROTATION At the heart of the flexible model is a criticism 
of a traditional division of labor characterized by numerous job classifications, 
narrowly defined and demarcated by rule, and highly fragmented. Narrowly 
classified jobs constrain managers’ ability to redeploy labor in step with fluc- 
tuations in demand cycles and have been blamed for work organizations' in- 
efficiency and for their inability to promote continual learning processes, to 
innovate and to adapt. The new model of work calls for the expansion and 
enrichment of jobs, for the blurring of occupational distinctions, and for job 
rotation. Illustrating the extent to which companies go to jettison cumber- 
some job pyramids in their search for flexibility, Adler (1992) found that an 
auto manufacturing plant in northern California reduced the number of skilled 
trades classifications from 18 to 2 and had only one classification for another 
category of personnel where previously there hac been 80. 

Reducing the number of job classifications and expanding remaining jobs can 
impart to workers new skill sets and new domains of responsibility. Yet many 
studies indicate that including a greater variety of tasks has often intensified 
the demands on individual workers, increasing the scope of their efforts and 
responsibilities without necessarily giving them new skills or upgrading their 
position in the work hierarchy. Job intensification is often confused with job 
enrichment and expansion: workers are asked to do more with fewer resources, 
although they do not necessarily acquire new skill areas, new decision-making 
powers, or higher organizational status or pay (O'Reilly 1994, Shaiken et al 
1986). Job enrichment programs such as cross-training for different positions 
have been implemented in some cases only once the jobs themselves have been 
deskilled by technological innovation (Taplin 1995:430). 


TECHNOLOGY  JIT systems reorganize the work flow and inventory method 
while job expansion schemes reorganize the division of labor. Employers also 
implement new technologies that are designed to do both and to build in the 
capacity for participation and flexibility. "Stop-the-line" assembly production, 
a technological innovation frequently implemented in tandem with JIT, under- 
scores job intensification. The assembly line system of mass production is well 
known for the way that it dictates to the worker the precise movements he or 
she will make and paces those movements, thereby removing all necessity for 
independent judgment or intervention. Under the stop-the-line principle, in 
contrast, workers use their discretion to pull a cord, push a button, or otherwise 
halt the flow of the line when an individual spots a defective part or evidence 


NEW FORMS OF WORK ORGANIZATION 323 


of shoddy assembly. The stop-the-line technique is often thought to democ- 
ratize work (Womack et al 1990) because workers do not have to seek the 
approval of managers to halt the production flow, but in addition it speeds up 
the production process and compels workers to “work frantically to remedy pro- 
duction or quality problems” (Kenney & Florida 1993:265; see also Graham 
1995:113). 

Programmable technology poses a potential for genuinely decentralized shop- 
floor control for workers, if they are trained and empowered to change product 
design and to retool equipment. Orgamizational case study researchers have 
found that, while programmable technology can increase flexibility for the 
work organization, it may not necessarily increase the flexibility of workers’ 
jobs. Although touted as facilitating decentralization, planning and design can 
remain in the hands of specialized programmers working under the direction 
of management rather than concentrated in workers’ hands on the shop floor 
(Shaiken et al 1986). 

Vallas & Beck’s (1996) research on programmable control systems and con- 
tinuous process technologies in manufacturing plants in the pulp and paper 
industry reinforces these findings. They found that in the absence of other 
innovations, technology introduced to bring plants into the new era of flex- 
ible production did not reconfigure and did indeed strengthen the traditional 
bureaucratic hierarchical relations of the plant. Furthermore, the ascendance 
of a technological paradigm in this manufacturing setting, which emphasized 
cutting-edge statistical knowledge and feedback but denigrated craft knowledge 
acquired over time and through hands-on experience, reinforced the supremacy 
of young, educated, cosmopolitan engineers over long-employed manual oper- 
ators, generating new forms of inequality between the two groups. Formerly 
craft-based manual workers were trained to familiarize themselves with a wide 
range of data generated by computerized process controls, but they had little 
discretion in deploying those data. 

Flexible technology such as computer-aided design and numerically con- 
trolled cutting systems is not inconsistent with and may even intensify the struc- 
ture of "rigid jobs"—.characterized by low trust, status, skill, and pay—typical 
of the traditional mass production system (Taplin 1995). Computerized systems 
can deskill jobs; they more subtly control their work force, as management uses 
it to monitor the location of products, provide detailed information about work 
performance, and build in quality control mechanisms (Taplin 1995:429). 


Social Relational Change: Teams and Self-Managed 

Work Groups 

Integral to structural innovations has been the effort to reorganize interpersonal 
interactions, to structure personal involvement and collaboration between non- 
managerial workers. Introduced under the rubric of employee participation 
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and empowerment, this reorientation of social relations cannot be extricated 
from the structural changes identified in the preceding section and indeed is 
woven throughout them. Workers come together -o identify and solve prob- 
lems, improve quality and efficiency, and innovate production processes in 
teams or self-managed work groups and consultative groups such as quality 
circles. Ideally, heightened engagement in work through greater engagement 
with coworkers can also give workers a greater stake in the profitability and 
competitiveness of the firm as a whole. 


QUALITY CIRCLES Quality circles (QCs) are created on an intermittent basis 
to address immediate problems in the work process and to generate ideas for 
reducing costs, saving money, or improving other work outcomes. Often initi- 
ated by managers, they can include workers from different functional areas and 
hierarchical locations. Researchers find that, numerically, QCs are the most 
widespread [with some estimating that over 90% of Fortune 500 companies 
use them (Lawler & Mohrman 1985)], but organizationally they are the most 
shallow of the social-relational innovations. Many studies conclude that QCs 
are management-initiated practices that coopt or depoliticize workers (Grenier 
1988), a panacea for a variety of organizational problems (Shea 1986), or a 
way to get employees more involved without conierring real power (Rafaeli 
1985). Notably, however, QCs also are documented as increasing job satisfac- 
tion (Lincoln 1989). 


TEAMS AND WORK GROUPS The spread of the team structure has had more 
limited application than QCs yet has greater poteatial for challenging tradi- 
tional hierarchical modes of control and authontv. Self-managed teams or 
work groups aren't simply organized ad hoc to discuss production processes; 
team organization is built into the labor process itself as small groups produce 
together, make decisions, rotate their positions, maintain machinery, and direct 
and pace their progress. Benefits to workers can include the opportunity to 
become multiskilled, the reduction of managerial supervision, increased au- 
tonomy and acquisition of new skills, and the growth of workers' pride 1n the 
goods and services they produce (Aquilano 1977, Safizadeh 1991). Theoreti- 
cally the spread of teams suggests a deep-seated reworking of the division of 
labor, although the literature suggests a range of e-fects that fall short of that 
potential. 

Individual team members can be cross-trained to perform different tasks 
within the team's domain, but the tasks can be repetitive and standardized 
(Taplin 1995:427), suggesting that team organization merely multiplies the 
number of deskilled areas for which any given worker is responsible. Remu- 
neration schemes that are team- or group-based depart from fiercely protected, 
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individualistic performance norms, leading workers to resist pooling their ef- 
forts with others (Kanter 1990, Taplin 1995). Managements’ commitments to 
teams may be largely ceremonial, a nod paid to the normative model of flex- 
ibility, which obscures the reality that teams have little power (Graham 1995) 
or are organized top-down by managers (Vallas & Beck 1996). When a team- 
based, flattened organizational model of primary nursing was introduced in a 
large hospital, nurses experienced heightened job autonomy and reunification 
of tasks, but simultaneously they found that their jobs became greatly intensified 
and that they came under greater scrutiny by managers (Brannon 1994). 

Studies of self-managed teams present one conclusion consistently: Team- 
based production methods represent a new, more decentered, and less visible 
tactic of control. Monitoring, evaluation, and disciplinary action moves down 
the hierarchy from the hands of supervisors and diffuses into the hands of team- 
mates (Sewell & Wilkinson 1992, Shaiken et al 1986). Team-based systems 
of control have been termed a “tyranny” (Sinclair 1992), “concertive” (Barker 
1993), “devolutionist’ (Sewell & Wilkinson 1992), and “unobtrusive” (Prechel 
1994)—multiple conceptualizations that have several common ingredients. In- 
dividual participation is held in line by intensified monitoring of the self, con- 
tinual supervision and disciplining by work peers, shifting of responsibility 
for production goals and even for the company’s survival onto the shoulders 
of workers, and the concealment of managerial power from team participants. 
In these ways, power potentially resides in all locations, emerging at different 
times and being exercised by different actors: supervisors, coworkers, and even 
customers. 

Decentered control of this sort is reinforced by other strategies, off-center, 
off-site, and all pervasive. In sales and service settings in which employers and 
managers strive to maximize flexibility and competitiveness, architectural and 
spatial arrangements allow customers to watch every aspect of workers’ move- 
ments and behavior, leaving nothing invisible (Walsh 1993). Customer surveys 
evaluate, monitor, and discipline workers, thus extending the nexus of control 
into all sets of interactions in the workplace; and "shoppers," individuals paid to 
use the firm's services anonymously, evaluate the performance of workers with- 
out the knowledge of the latter (Fuller & Smith 1991, Leidner 1993). Cultural 
strategies of control in professional, white-collar settings similarly decenter 
power by locating it in the fabric of everyday life. Cultural programs are de- 
ployed strategically to amplify normative commitments to constant change and 
pressure, to encourage self-monitoring and heightened involvement in shap- 
ing how corporate goals are achieved, and to position workers to continually 
self-reflect, improve, and innovate (Kunda 1992, Smith 1990). 

Research on the structural and social-relational changes initiated with the 
goal of securing workers’ involvement, to make workers, and thereby work 
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organizations, more flexible and competitive, suggests that such changes insti- 
tutionalize a basis both for the acquisition of new skills and competencies and 
for securing intensified effort within a context of decentered but hierarchical 
control. These outcomes are obviously ambiguous for US workers. The deeper 
personal and interpersonal engagement in the planaing and execution of work, 
even when imposed from above, can afford numerous opportunities for indi- 
vidual and collective protection of interests and for greater self-determination 
on the shop and office floor. 


NONINVOLVED WORKERS, FLEXIBLE FIRMS 


Employers have also achieved organizational flexibility by expanding and con- 
tracting the size of their work forces to accommodate fluctuations in production 
and service cycles. One of the most important employment trends in recent 
years has been the replacement of permanent by temporary and part-time jobs 
in American firms and an increase in the numbers of people who are hired and 
let go on a contingent (Callaghan & Hartmann 1991, Doeringer et al 1991) 
or externalized (Lozano 1989, Pfeffer & Baron 1988) basis. This side of the 
flexibility coin runs counter to the organizational redesign discussed in the 
previous section; indeed, participation and involvement is targeted for core, 
permanent workers, who are in turn buffered from market fluctuations by new 
hiring practices that "dispose of" (Geary 1992, Thomas 1994) or "throw away" 
(Graham 1995) marginalized workers. The peripheral or contingent work force 
consists of temporary, part-time, seasonal, and subcontracted workers (Belous 
1989, Gottfried 1991, Henson 1996, Parker 1994, Uzzi & Barsness 1995). Pre- 
cise measurement of its size is difficult because of definitional ambiguities and 
because researchers may count the number of jobs rather than the number of 
job holders in various categories (Tilly 1991), but researchers estimate that it 
includes from one quarter to one third of the American work force. 

The research on flexible staffing practices reveals where men and women of 
diverse race and ethnic groups and white women fall in the flexibility equation. 
À little over one quarter of women (in contrast to a little over one tenth of men) 
who work for pay work part-time (Williams 1995), and two of three temporary 
workers are women; blacks are also overrepresented in the temporary work 
force at 2096 of the total (Belous 1989, Callaghan & Hartmann 1991). 

Researchers identify a number of factors governing the decision to use con- 
tingent workers to fine-tune staffing levels. Employers able to draw on a force 
of contingent workers avoid having workers on their payrolls when demand is 
slack; they reduce their wage and benefits costs v/hen they have temporary or 
part-time workers on board and save by externalizing the administrative costs 
entailed with recruitment, hiring, and control of temporary workers [Pfeffer & 
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Baron 1988, Smith 1994; see also Harrison & Kelley (1993) on different dimen- 
sions of flexibility that corporations attempt to gain by outsourcing, Davis-Blake 
& Uzzi (1993) on organization-level, and Cassirer (1995) on occupational- and 
industrial-level variables that have positive effects on the use of temporary and 
part-time workers and independent contractors, and Hossfeld (1995) on em- 
ployers’ use of temporaries to forestall their permanent workers from joining 
unions]. 

The growth of the contingent labor force 1s associated with increased “down- 
sizing" in American firms. Companies reduce the overall size of their perma- 
nent work forces and take on temporaries only as needed; and companies cut 
functional task areas from their business and subcontract the services of other 
firms to fill in this gap (Harrison 1994, Smith 1994). Of companies that re- 
ported job cuts between 1990 and 1995 [included in the American Management 
Association (1995:7) survey on downsizing, job elimination, and job creation], 
59% reported the subsequent increased use of temporary workers to replace 
those terminated. 

Employer rationale for using temporary workers has changed qualitatively in 
recent years. Whereas temporary workers have long been used to fill in for sick 
or vacationing workers, only recently have employers been hiring temporaries 
in massive numbers to fill formerly permanent positions. Temporary work now 
constitutes one of the fastest growing segments of the labor force (Plunkert & 
Hayghe 1995). The temporary help services industry has been a significant 
source of job growth in the 1990s (Gardner et al 1994) and will continue to lead 
job growth into the twenty-first century (Franklin 1995). 

It is tempting but misleading to group together temporary, subcontracted, 
and part-time work into a homogeneous category of contingent workers. For 
example, part-timers are not externalized workers, as are temporaries and sub- 
contracted workers (Reskin 1996). Further, there is only weak evidence for 
the claim that the staffing flexibility achieved by using part-time workers is 
new. Employers have long structured many part-time positions primarily as 
a solution to meeting fluctuating demand. Using part-timers allowed employ- 
ers in retail sales firms and food service establishments, for example, to bring 
on extra staff, primarily women with children who desired flexible schedules, 
during peak periods of business. What does appear to be new is the degree 
to which employers are using part-timers explicitly to hold down or cut wage 
costs, reflected in the fact that the part-time work force has grown gradually, 
but that almost all of that growth has taken place in the ranks of involuntarily 
employed part-timers (Jacobs & Qian 1996, in press; Tilly 1991, Tilly 1992). 
Employers now create part-time jobs and convert full-time jobs to part-time, 
primarily as a strategy to cut labor and benefits costs. They use part-timers in 
jobs that aren’t necessarily governed by daily ebbs and flows 1n the workload, 
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such as data entry and claims processing positions in the insurance industry 
(Tilly 1991:16). 

This trend has had implications for the gender composition of part-time 
work as well: As employers have structured more jcbs on a part-time basis and 
fewer as full-time, the proportion of people working part-time involuntarily has 
risen, and the number of men working involuntarily in part-time positions has 
risen as well because they have been unable to obtain full-time employment 
(Tilly 1991). In large part because temporary work has held a clearer position 
as a strategic pillar of flexible staffing practices, researchers have disproportion- 
ately focused on temporary jobs and the temporary help service industry. More 
specifically, they have focused disproportionately on lower-paid, less-skilled 
temporary workers rather than highly paid independent contractors. 

Early accounts, largely descriptive, of the growth of the contingent labor 
force and its role in creating flexibility for work organizations conveyed a sense 
that using contingent workers was a seamless process; workers were simply 
brought in when needed and let go when not. Employers benefited, in this 
picture, and permanent workers apparently were indifferent to, if not isolated 
from, the presence of new, tenuously positioned coworkers. The contingent 
work force was also portrayed in broad terms that didn’t capture the range and 
complexity of temporary and part-time positions. The prevailing view was that 
the labor market was characterized by a two-tiered system of employment, di- 
vided into core workers receiving good pay, benefits, opportunities for training 
(and in fact were much more likely to encounter the flexible production prac- 
tices discussed in the previous section), and some degree of job security, and 
peripheral, contingent workers who typically earned lower wages, received no 
benefits, worked in low skill jobs with few training opportunities, and faced sig- 
nificant job instability and insecurity (Callaghan & Hartmann 1991, Colclough 
& Tolbert 1992, Smith 1993). Core and peripheral workers also appeared to 
be hired for different jobs and in different locations. Recent studies have de- 
veloped a more fine-grained analysis of the consequences of this employment 
trend: bifurcation of skill and status, and resistance and conflict that develop 
on shop floors and in offices between permanent and contingent employees. 


Research Findings on the Contingent Work Force 


BIFURCATION AND VARIATION ‘Temporary positions traverse occupational and 
labor market boundaries (Mangum et al 1985). Most temporary workers are 
concentrated in the lowest levels of the occupational hierarchy, are typically 
hired for the least skilled jobs, and earn, on average, only a percentage of what 
their permanent wage and salary counterparts earn (Callaghan & Hartmann 
1991, Henson 1996, Thomas 1994). A smaller and growing number of individ- 
uals work in comparatively well-paid professional, technical, and managerial 
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positions on a temporary basis (Christensen 1989, Meiksins & Whalley 1995, 
Parker 1994, Rogers 1994). These are more likely to be independent contrac- 
tors than temporary help service employees, and thus they have higher, if not 
independent, status, earn higher wages, and work on an externalized (short- 
term but regular and predictable employment) and not necessarily contingent 
(short-term and irregularly and unpredictably employed) basis (Reskin 1996), 
Additionally, the temporary labor market cuts across peripheral and core firms 
(Cohen & Haberfeld 1993). 

Researchers taking a close look at hiring arrangements have found significant 
variations in employment practices and job tenure of temporaries. The organi- 
zational boundaries between permanent and temporary workers and within the 
temporary work force itself are imprecise and often obfuscated. Some employ- 
ees appear to be temporary workers from the viewpoint of one company's hiring 
practices—when their services are contracted by one firm on a delimited basis, 
when their wages are low and their work is routine—but 1n fact are permanent 
employees of another firm whose core business 1s to subcontract out the labor 
and service of their full-time and permanent workers (Smith 1994). Tempo- 
raries may be temporary in name only, often working for very long periods of 
time, receiving lower wages and no benefits, but performing the job and devel- 
oping the skills and organizational experience that a permanent worker might 
(Henson 1996, Milkman 1992). Temporary status may, on the other hand, be of 
short duration but lead to long-term employment, when employers use the ar- 
rangement as a recruitment channel, screening temporaries and watching them 
on the job to decide if they are qualified and trustworthy enough to be hired 
on a permanent basis (Cohen & Haberfeld 1993, Henson 1996, O'Reilly 1994, 
Smith 1996). 

Indeed, the boundaries between temporary and permanent workers can be 
ill-defined and multilayered, as both groups do the same tasks, hold the same 
positions (including informal supervisorial and coordinator positions), work 
the same hours, or wear the same uniforms. Companies can have multiple 
categories of temps, such as permanent and casual temporary workers (Project 
on Disney 1995). And the boundaries can be extended far afield, as firms, in 
their search for manufacturing flexibility, outsource production to small firms 
which, in turn, employ cadres of temporary workers. 


HIDDEN COSTS, CONFLICTS, AND ACCOMMODATIONS The presence of tempo- 
rary workers, posited to be cost-saving and flexible, can create hidden costs, 
rigidities, and tensions on the shop floor. A study of a white-collar service 
delivery site found that permanent workers collectively resisted management's 
effort to hire a batch of temps for an unexpected, large, and complex photocopy 
job (Smith 1994), on the grounds that temporary workers would not maintain 
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the appropriate standards for producing quality documents. The literature con- 
tains numerous examples of permanent workers’ animosity toward temporary 
workers when they work side by side, as they often do. Permanent assembly 
workers bore deep ill will toward temporary employees whom they considered 
to be unsuitable to carry out the job (Geary 1992); and permanent production 
workers felt that temps “just watched the clock” and ignored the quality of 
their work (Thomas 1994:123). Assembly workers in auto manufacturing felt 
that temporary assemblers performed “shoddy” work, yet their disapproval was 
complicated by their realization that their comparatively permanent, more priv- 
illeged status was made possible by the exploitation of the temporary workers 
(Graham 1995:131—35). l 

Management and workers may have to dedicate additional, unanticipated 
time to managing the work efforts of temps (Geary 1992, Gottfried 1991, Smith 
1994, Uzzi & Barsness 1995). In part, management policy toward temporaries 
often undercuts their own cost-cutting goals: Typically, temporary workers 
are not trained in the same fashion that permanent workers are (Geary 1992, 
Milkman 1992, Thomas 1994). Furthermore, temporaries typically are not 
invited to participate in company functions like picnics and holiday parties. In 
general, mixing permanent workers and temporaries yet not integrating temps 
into the culture of the shop or office floor can lead to tension and conflict, 
particularly if permanent workers view temporaries as a reserve army of labor, 
ready and willing to step into their full-time, permanent jobs (Geary 1992). 

Dynamics surrounding the use of temporaries look somewhat different when 
examined from their perspective: Temporaries may overcompensate for their 
tenuous status in the workplace by striving to become "super" employees who 
do high-quality work (Henson 1996), and they may engage in multiple, albeit 
individualized, everyday acts of resistance against the degraded conditions of 
their jobs: restricting output and walking off the job before their contract ex- 
pires under particularly stressful circumstances (Sottfried 1994, Parker 1994), 
and “looking busy” or "cruising" when they've been assigned very little work 
(Henson 1996:131, Rogers 1995). Temporaries also strive to control the con- 
ditions of their assignments by turning down jobs that they view as "emotion- 
intensive," in order to avoid being drawn too deeply into the politics and social 
relations of the firm (Rogers 1995). 


Destabilized and Intensified Employment for 

Managers and Professionals 

Instability and insecurity once thought to be confined to workers in the lowest 
levels of the labor market have spread through the ranks of American managerial 
and professional workers. Researchers recently have focused on a broad, desta- 
bilizing set of changes touching the careers of salaried employees, professional 
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and managerial alike. There is a growing body of literature on the “boundary- 
less” career, marked by continual mobility across and up through a number of 
work organizations (Baker & Aldrich 1966, Defillippi & Arthur 1994, Hirsch & 
Shanley 1996), although the evidence for the argument that people are chang- 
ing careers and occupations at a greater rate 1s tentative. Swinnerton & Wial’s 
(1995) data [which the authors note are “not conclusive evidence of a trend” 
(p. 303)] are cited widely: They found a slight but significant decrease in job 
stability for the years 1987—1991. 

Despite the absence of definitive evidence, there 1s a widespread perception 
in the literature that career stability is on the decline (Brown 1995). Managerial 
and professional displacement and unemployment, and white-collar unemploy- 
ment in general, has been unprecedented and the recreation of white-collar jobs 
slow (Mishel & Bernstein 1994). In-depth interview and observational studies 
of corporate restructuring and downsizing (Glassner 1994, Gould et al 1996, 
Heckscher 1995, Newman 1988, Smith 1990) and overviews of the erosion of 
internal labor markets and the subsequent transformation of managerial and 
professional employment (Brodsky 1994, Doeringer et al 1991, Hirsch 1993, 
Osterman 1988) have generated a picture of tremendous upheaval in the ranks 
of the middle-ranking employees of America’s corporations. Reports abound, 
in both scholarly and popular literature, of the widespread elimination of man- 
agerial ranks, of the difficulty of finding comparably paid and secure positions, 
and of the downward mobility of professionals and managers. The American 
Management Association survey (1995) on downsizing and displaced workers 
bolsters this picture. Of the level-identified jobs that had been cut between 
1988 and 1995, 18% were held by middle managers, who make up between 
5% and 8% of the American work force. Strategic or structural cuts dispropor- 
tionately affect management ranks (AMA 1995:4), suggesting that the middle 
ranks of large corporations are targets of cost-cutting strategies and that their 
work subsequently is being pushed into the lower levels of the firm. 

Adding to the impression that intensification of work is permeating and re- 
shaping managerial and professional employment, paralleling the intensifica- 
tion of nonmanagerial jobs, DiTomaso (1996) finds that even corporate workers 
employed on a permanent, secure basis are increasingly expected to act like sub- 
contracted workers. Employers expect managers and professionals to think of 
themselves as entrepreneurs or as self-employed, to continually ask how they 
can add more value to their work organization, and to self-monitor, to deeply 
internalize the viewpoint that they “are likely to be retained by the organization 
only as long as their expertise serves the needs of the organization” (DiTomaso 
1996:11). The strength of this normative system, in which “everyone is sub- 
contracted,” depends greatly on decentered organizational systems in which 
self-monitoring is essential (1996:10). Even those who dispute the argument 
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that middle management has been decimated acknowledge the sea-change in 
the norms and expectations for professional careers (Gordon 1996). 


CONCLUSION: INTENSIFIED, DECENTERED, 
AND DESTABILIZED WORK 


Social science research suggests that the transformation of production and the 
transformation of employment are reconfiguring the American workplace, in 
both practice and in rhetoric. Although specific outcomes of the flexible model 
are shaped by occupation and hierarchical position, its central features of em- 
ployment instability, decentered control, and work iatensification run across the 
occupational spectrum. I have suggested that the model can be characterized as 
combined and uneven, contradictory, and even flatly confusing. This ambigu- 
ity highlights the limitations to viewing the flexible model in polarized terms, 
as being either progressive, enabling, or a high-performance approach, or co- 
ercive, restrictive, or a low-performance approach (Appelbaum & Batt 1994, 
Gordon 1996). The new model combines elements of the two but in different 
configurations, playing different groups of workers off one another, eliciting 
participation from some by denying participative opportunities to others. Thus 
the two sets of changes discussed in this essay ar2 integrally bound up with 
one another, although they paradoxically continually undermine one another as 
well. 

A new participative basis for production has been institutionalized in many 
work settings, mostly production and manufacturing, mostly for a core group of 
employees. Individuals in the core are exposed to a context of opportunities: op- 
portunities to learn new skills in technology, data gathering and processing, and 
self-management. New production methods raise responsibility levels and in- 
crease decision-making opportunities in many cases, albeit within a constrained 
and decentered system of control. Thus, while genuine autonomy and control 
remain beyond the grasp of most workers, workers are encountering genuinely 
new ways of producing and serving. Importantly, their gains in organizational 
responsibility and involvement are incurred at the expense of corporate workers 
in supervisorial and management positions, with thenew culture of participation 
resting on the simultaneous downward transfer of managerial responsibilities 
and the elimination of layers of managers (Smith 1290:ch. 7). 

The studies reviewed here strongly suggest that core workers benefit from the 
new approach because they are buffered by a peripheral work force brought in to 
absorb fluctuations in work cycles but specifically excluded from participating 
in these organizational innovations. Contingent workers are not trained 1n new 
production methods, nor are they authorized to rarticipate in self-managed 
groups or quality circles. In contrast, by and large they are assigned to the 
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least skilled, most easily filled jobs, and they are monitored by their permanent 
counterparts. They are typically excluded from the company events and rituals 
that are organized socially to glue together the new participative agenda. 

It appears that the corporate agenda to secure workers’ involvement and 
quality effort is bolstered by the presence of a large and growing contingent 
work force that signals to permanent workers what can happen if one should 
refuse or limit one’s efforts. That significant numbers of people want full-time, 
permanent work but can’t find it, and that “the character of unemployment has 
decidedly shifted: a much larger proportion of the work force is either under- 
employed, overemployed, low paid, or trapped in unfavorable job situations” 
(Mishel & Bernstein 1994), are important ingredients in understanding per- 
manent workers’ consent to new participative arrangements. In other words, 
the control embedded in organizational mechanisms intersects with the control 
exerted through employment practices. Cognizant of their dispensability, and 
aware of scores of contingent workers seeking full-time and permanent jobs, 
permanent workers may be more disposed to learning how to work, and working 
intensively, within the demands of the participative and flexible model. 

An additional piece of evidence indirectly supports this. In a survey of 
companies that had downsized throughout the 1990s, 43% reported that they 
extended the working hours of remaining employees rather than hiring new 
ones (AMA Survey 1995:7). The question cf whether US workers are working 
longer hours has been much debated (Schor 1991, Robinson & Bostrum 1994), 
yet the AMA figures, as well as data of economists (Gordon 1996, Mishel & 
Bernstein 1994), reinforces the claim that American workers are doing more, 
prompted by a context of downsizing, cutbacks, declining wages, and employ- 
ment transformation. 

Aside from the uneasy mix of permanent and temporary workers, another 
tension underlies the two approaches to achieving greater flexibility. When 
companies downsize they can erode the commitments of remaining employees 
to principles of quality, participation, and adaptability. As Cameron (1995) 
notes, downsizing is antithetical to improving the quality of individual efforts’ 
and of the work organization; and in the United States employers continue 
to deploy both strategies, often without regard to the way they run at cross- 
purposes (Appelbaum & Batt 1994). 

Thus, unease and destabilization permeates work and employment in the 
late twentieth century. Involvement and empowerment are at the core of the 
new model of work, yet workers are compelled to engage in the organizational 
mechanisms by which they are achieved, by means of a multifaceted and decen- 
tered system of control. Decentered control systems incorporate and reproduce 
particular hierarchical features of traditional control systems, but their features 
have taken new forms. They insert mechanisms for monitoring and evaluating 
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into all the pores of the organizational hierarchy: in new technological systems 
that keep track of workers’ performances, in new ways of organizing the work 
flow that bring workers to the foreground of the attention of coworkers and 
supervisors, in work groups that heighten the degree to which individuals mon- 
itor themselves and each other, or in measures for incorporating the subjective 
interpretations of customers as a source of data for evaluating workers. Yet 
traditional control is coupled with a heightened job requirement to be more 
involved and responsible, often without added compensation or resources. 

The current emphasis on involvement and intensification in the context of 
destabilized employment suggests a great irony, wherein sociologists of work 
and workers themselves may begin to romanticiz earlier production arrange- 
ments in which secure nonmanagerial workers had the “privilege” of “leaving 
their brains at the door” of their hierarchical, centrally administered workplaces. 
While such nostalgia would simplify the historical record, the full and doubtless 
numerous implications of having to continually “cwn” and “manage” one’s job 
to the maximum extent possible for the good of the firm, need to be considered 
in greater depth. 

Finally, the changes documented in literature ən new forms of work orga- 
nization are most certainly redrawing the lines of gender and race hierarchy 
in the United States. Increased employment destabilization and contingent 
hiring practices likely will erode the stable, male career model characterizing 
large corporate bureaucracies of the postwar era (Acker 1990, Smith 1993). In 
tandem, these trends may increase some white men's "nonstandard" employ- 
ment experiences, leading their employment patterns to converge with those 
marginalized and uncertain patterns of many white women and women and 
men of diverse racial and ethnic groups (Reskin 1996). 

The terrain of work and employment in the era of flexibility is a relatively un- 
charted area of social science research. Recent studies have begun to illustrate 
the complexities and nuances of work organization, work relations, and hiring 
practices, but much remains to be understood. Four topics are of particular con- 
cern. First, more research is needed on service work settings and nonunionized 
workplaces. Service sector work far exceeds manufacturing work, on which 
the bulk of research on flexible models has been done. When workers produce 
services and even work on people, we would anticipate finding different organi- 
zational definitions of flexibility, as well as different outcomes related to team 
Work, decision-making, skill, and autonomy than -hose found in production set- 
tings. Second, this field would benefit from having data on mobility processes 
and outcomes as they are shaped by the contradictory forces of production and 
staffing flexibility. A third research goal shou.d be to analyze aspirations 
and identity at the crossroads of production and employment transformation, 
and corporate restructuring more broadly. We need a deeper understanding of 
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personal experience, subjective interests, and of how aspirations are sustained 
or crushed as the opportunity structure undergoes changes that appear to be 
permanent and radical. These will be essential in our understanding of consent 
to new production arrangements and of corporate employees’ participation in 
turmoil and instability. Finally, cross-cutting all these topics with a focus on 
the gendered and racialized dimensions of opportunity and outcome will 1nte- 
grally link the sociology of work and corporate transformation with the fields 
of stratification and inequality. 
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ABSTRACT 

This paper surveys sociological approaches to the study of markets. After consid- 
ering the economic approach, I delineate the wide range of theoretical schools, in- 
cluding alternative schools in economics, economic anthropology, cultural socio- 
logy, the embeddedness approach, and the new political economy. I also briefly 
discuss recent debates on the transition from planned economy to market and 
on globalization. I conclude by noting the difficulties of theorizing about the 
historical and institutional complexity of markets. 


Introduction 


We live in the age of the market. The category of the market dominates everyday 
discourse and political reality. Jobs, spouses, and commodities are all said to 
be obtained in their respective markets. After the collapse of communism, 
the market appears as the desirable and perhaps even the only viable form of 
exchange or coordination in a complex economy. 

The triumph of the market, in turn, elevates the standing of a science devoted 
to its explication. Neoclassical economics, in spite of persistent and powerful 
criticisms, emerges as the paradigmatic framework to analyze all spheres of 
social life, as evinced by the ascent of rational choice theory in political science 
and sociology. Nonetheless, the spread of the economic approach—orthodox 
microeconomic theory or necclassical economics—to noneconomic spheres is 
problematic. More important, broadly sociological approaches challenge the 
economic approach even in the study of markets. 

In this paper, I survey the major sociological approaches to the study of mar- 
kets. Several caveats are in order. Excellent recent reviews describe the exten- 
sive and expanding literature on markets in particular (Friedland & Robertson 
1990, Swedberg 1994) and economic sociology in general (Swedberg 1987, 
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1990, 1991; Smelser & Swedberg 1994). I am not vigilant in demarcating 
markets from economies tout court; the blurred boundaries are symptomatic of 
different conceptualizations of markets. Although I am not overly concerned 
about disciplinary distinctions, this paper has very little on labor markets or on 
the social psychology of market exchange. 


The Neoclassical Market and the Economic Approach 


The market is a central category of economics. Mark Blaug (1985:6), the 
indefatigable chronicler of economic thought, writes: ‘The history of economic 
thought. .. is nothing but the history of our efforts to understand the workings 
of an economy based on market transactions.” It is then curious that the market 
receives virtually no extended discussion in most works of economic theory or 
history. À promising title such as Theory of Markets (Allingham 1989) does 
not describe what constitutes a market or explain how it operates (cf Isachsen 
et al 1991). Similarly, three recent tomes on market and history offer but 
one sentence on the market itself (Anderson & Latham 1986, Galenson 1989, 
Haskell & Teichgraeber 1993). In point of fact, the absence or ambiguity of 
the market concept is as old as economics itself. JE Cairnes (1888:100), for 
example, criticized Adam Smith because "it is not quite clear... in what sense 
he [Smith] uses the word ‘market’... " The market, it turns out, is the hollow 
core at the heart of economics. 

What is the market in orthodox economics? Most economists follow Cournot 
(1897:51) in suggesting that price uniformity within a particular area signifies 
the existence of a market. The market clears—supplies are exhausted, while 
demands are satisfied at a given price. But what is the sociological description 
of the market? Milton Friedman (1962:14), for example, writes: “the central 
characteristic of the market technique of achieving co-ordination is fully dis- 
played in the simple exchange economy that contains neither enterprises nor 
money." This accords with the usual dictionary definition: “Generally, any 
context in which the sale and purchase of goods and services take place. There 
need be no physical entity corresponding to a market” (Pearce 1986:263). In 
other words, the neoclassical market is shorn of social relations, institutions, or 
technology and is devoid of elementary sociological concerns such as power, 
norms, and networks. | 

The very abstraction of the market—its ontological indeterminacy—allows 
for its universal applicability (cf Rosenberg 1992). The analytical structure 
developed for the abstract market, in other words, can be used for nonmar- 
ket spheres precisely because there is nothing particular about the institution 
or the structure of the abstract market. According to the economic approach, 
the phenomenological diversity of markets—from the medieval marketplace 
to the modern placeless market—1n fact reveals deep isomorphism, or market 
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essentialism. For explanatory purposes, diverse relations and institutions of 
economic exchange can be analyzed as the singular market. Rather than an- 
alyzing markets, the market is used to explain different instances of market 
exchange. 

The indisputable intellectual leader of the economic approach is Gary Becker. 
Ranging from racial discrimination to crime to marriage and the family, Becker 
has colonized literally all of the social sciences as grist for his analytical mill. 
The economic approach starts from the assumption that “all human behavior 
can be viewed as involving participants who maximize their utility from a 
stable set of preferences and accumulate an optimal amount of information 
and other inputs in a variety of markets” (Becker 1976:8). As he declares, 
the economic approach “is applicable to all human behavior’ and to all types 
of human beings (Becker 1976:8). Pierre Bourdieu, in this regard, resembles 
Becker in applying the market or economic metaphor to other realms of social 
life (Bourdieu 1977). In a different vein, exchange theorists such as Blau and 
Homans sought to apply the essential exchange framework to disparate spheres 
of social life (Ekeh 1974). However, just as market theories are not the same 
as theories of markets, exchange theorists did not always theorize exchange. 
Rather than explaining markets or exchange, they employ markets or exchange 
to explain social and economic life. 

The most concerted effort to apply microeconomic theory has occurred in 
the realm of politics. From the late 1950s, social scientists have used neoclas- 
sical economic analysis to make sense of governments and movements (e.g. 
Downs 1957). It is ironic that a presumed shortcoming of neoclassical eco- 
nomic analysis—the absence of power—turns out to be the sphere where it has 
had its most widely recognized success (e.g. Green & Shapiro 1994). In sociol- 
ogy, James Coleman (1990) has championed the use of the economic approach. 
Centered around the journal Rationality and Society, rational choice theory 
sustains a devoted, albeit still small, network of scholars. Its influence, owing 
in no small part to Coleman, is manifest in the diffusion of and the obeisance 
made to the choice-theoretic language in contemporary US sociology. 

The economic approach has also been criticized in the very sphere—the 
market and the economy—from which it originated. Consider the US securities 
market, which is the paradigmatic instance of an actually existing market. To the 
extent that neoclassical economists consider the sociological grounding of the 
market, they usually envision a modern stock market as in Walras's (1954:83— 
84) classic account. The social organization of Wall Street, however, reveals 
little resemblance to the neoclassical imagery. The prevalence of network 
ties and the premium on inside information make market operations far from 
the ideal of individual utility maximization with perfect information. Indeed, 
stock markets across space and time are rife with political interventions, social 
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networks, and power considerations. Hence, neoclassical economic analysis 
is problematic in making sense of either eighteenth-century London financiers 
or contemporary Wall Street moguls (Abolafia 1997, Baker 1984, Mirowski 
1988:210—32, Carruthers 1996). 

The social landscape of the economic approach is like a desert —far from the 
sociological concern with actual towns and cities. Externalities and institut- 
ions—the very areas of sociological research interests—are neglected (Hodgson 
1988, Block 1990, Papandreou 1994). Thus, large corporations and the inter- 
ventionist state are generally passed over (Chandler 1977, Sklar 1988, Fligstein 
1990, Block 1994). It is not surprising, then, that corporate power remains 
a virtually untouched topic in the economic approach (cf Schwartz 1988, 
Granovetter 1994). Although the economic approach would apply economic 
logic to politics, it seems more appropriate to apply political logic to economics 
(cf Piore & Sabel 1984). 

The economic approach is, moreover, antisociological. Consider Gary 
Becker's (1976:8) claim that the economic approach is applicable to all hu- 
man beings: "rich or poor persons, men or women, adults or children, brilliant 
or stupid persons, patients or therapists, businessmen or politicians, teachers or 
students." In advancing this universalist claim, the economic approach elides 
social differences. Inequality—the backbone of contemporary sociology— 
does not play a significant role in the economic approach (Lie 1992). In other 
words, class, race and ethnicity, gender, or any other social attribute remains es- 
sentially outside the theoretical purview of the economic approach (e.g. Ferber 
& Nelson 1993, Folbre 1994, Nelson 1996). And what assumption would be 
more inimical for contemporary US sociology than the assertion of the abstract 
individual that renders social inequality and difference irrelevant? 

In criticizing the economic approach, I do not mean to dismiss it altogether. 
The economic approach is, after all, not all of one piece (for latest developments, 
see Journal of Economic Literature and Journal of Economic Perspectives). 
Furthermore, the dialogic character of academic discussion ensures that once 
a criticism is raised against a particular view, it evolves to meet the criticism 
(and to deny that it had subscribed to one naive assumption or another). 


Some Alternatives in Economics 


Many economists have opposed historically orthodox positions and challenged 
dominant paradigms (Heilbroner & Milberg 1995). Thorstein Veblen, Karl 
Polanyi, Piero Sraffa, Joan Robinson, Paul Sweezy, Gunnar Myrdal, and John 
Kenneth Galbraith are but some of the illustrious dissenters. Even in contem- 
porary US economics, dominated as it is by the marginalist orthodoxy, there are 
evolutionary, institutionalist, Neo-Marxist, Post-Keynesian, Neo-Austrian, and 
other schools of thought (e.g. Bowles & Edwards 1990, Hodgson et al 1994). 
Journals, such as Cambridge Journal of Economics, Economy and Society, 
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Journal of Economic Issues, and Review of Radical Political Economics, fea- 
ture information and insights unavailable in mainstream publications. Be that 
as it may, very few have made sustained efforts to analyze the market or to probe 
its theoretical foundations (cf Robinson 1980:146-167, Vickers 1995). Iron- 
ically, even some of the most vociferous dissenters often accept the orthodox 
position on the market. Let me, however, briefly survey some of the revisionist 
movements within the American economics profession. 

Numerous theoretical leads have yet to be fully developed. It is unfortunate, 
for example, that so few economists have built on the early theories of monopoly 
competition, such as Chamberlin (1933) and Robinson (1933) (cf Blanchard 
& Kiyotaki 1987). A variety of post- Walrasian economics seeks to supersede 
the restrictive assumptions of orthodox microeconomic theory (Bowles et al 
1993, Colander 1996). The innovative works of George Akerlof (1984) also 
articulate the consequences of considering assumptions different from those 
of orthodox theorizing. Some economists take conflict seriously (Garfinkel & 
Skaperdas 1996), while Telser (1987) argues that excessive competition under- 
mines economic efficiency. John Hicks (1989) argues for the interdependence 
of methods of exchange, media of exchange, and market structures. 

Industrial organization—a longstanding subfield of economics—is notable 
for its institutionalist orientation. Its primary task is to analyze the relationship 
between market structure and performance (Bain 1968, Scherer 1980, 1992). 
In analyzing barriers to entry, for example, students of industrial organiza- 
tion focus on issues ranging from advertising and product differentiation to 
monopoly power (Bain 1956, Schmalensee & Willig 1988). The leading schol- 
ars, however, analyze markets or industries that do not fitthe condition of perfect 
competition, and hence they leave the orthodox economic theory intact. These 
exceptions are, however, probably more of a rule. In a relative vein, anti-trust 
economics analyzes institutions and the legal system (e.g. Williamson 1987). 
Another noteworthy development is the theory of contestable markets, which 
is, however, less concerned with institutions (Baumol et al 1982). 

New institutional economics, inspired by Coase's theory of the firm, has also 
sought to overcome the limitations of orthodox economics (Williamson 1975, 
Coase 1988). In considering transaction costs, they incorporate the assump- 
tion of bounded rationality and regard the firm as a governance structure (cf 
Granovetter 1985). New institutional economics has inspired innovative works 
in economic history (e.g. North & Thomas 1973). In a relative vein, there are 
evolutionary economists who consider history and institutions in their analyses 
(Nelson & Winter 1982, Hodgson 1993). 

Institutional economists also remain active (Hodgson 1988, Rutherford 1994). 
Inspired by the fundamentally sociological insights of John Commons and 
Thorstein Veblen, they analyze markets as patterned behavior following insti- 
tutionalized rules. Their starting point is ownership; its transaction constitutes 
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the ultimate unit of analysis. The old-style institutionalists criticize new insti- 
tutional economics for failing to take seriously concrete institutions, historical 
variations, and power relations. In particular, radical institutionalists stress the 
role of corporate power in the US economy (Eichner 1976, 1991, Peterson 
1988, Dugger 1989). Their marginality in the American economics profession 
stems from their putative focus on description (and lack of formalization) and 
their normative thrust (as opposed to the presumed neutrality of neoclassical 
analysis). The neglected works of John Kenneth Galbraith (1956) and Robert 
Brady (1943) on business power also deserve reconsideration. 

Marxist economists have generally accepted the neoclassical conceptualiza- 
tion of the market, albeit usually dismissing it on political grounds (cf Moore 
1993). Most notable theoretical developments—whether the revival of classi- 
cal political economy via Piero Sraffa or the rise of analytical Marxism and the 
fusion of Marx and Walras—have also moved Marxists away from considering 
empirical markets (Sraffa 1960, Roemer 1981). A notable exception 1s Samuel 
Bowles and Herbert Gintis's (1990) effort to theorize contested exchange. In 
addition, William Lazonick (1991) challenges a number of cherished neoclas- 
sical economic assumptions, including the efficiency of competition. 

Intellectual history has also provided powerful critiques of neoclassical eco- 
nomics (Tribe 1981, Hont & Ignatieff 1983; cf Winch 1996). Philip Mirowski’s 
More Heat Than Light (1989) demonstrates the transposition of the mathemat- 
ical formulae and thinking of classical mechanics and physics to marginalist 
economics, thereby questioning the foundations of neoclassical economics. 
Intellectual historians, however, do not propose an alternative framework (cf 
Fonseca 1991). 

Finally, the popular writings of Greider (1992), Kuttner (1987, 1997), 
Heilbroner (1988), Silk & Silk (1996), and others have useful insights for 
understanding markets (see also Thurow 1983, Hirschman 1986). They should 
certainly not be dismissed as mere journalism. 


The Moral Economy 


A persistent and powerful response to the market—-whether the theory or the 
reality—is to reject it on moral or political grounds. An exemplary figure in 
this regard 1s John Ruskin, whose passionate and prophetic writings influenced 
generations of radical thinkers (Sherburne 1972). His criticism of greed and its 
negative consequences—such as the unraveling of the social fabric or the de- 
struction of nature—have redounded throughout the twentieth century. Indeed, 
the development of markets is in many ways coeval with its critics (Thompson 
1988). 

The moral critique of the market has a long pedigree. Aristotle's economic 
thought is, for example, inextricable from his moral philosophy (Meikle 1995). 
The entwinement of markets and morals is not just a matter of intellectual 
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history. The operations of the open market in seventeenth-century England 
cannot be told apart from extensive political regulations and the moral economy 
of the time (Thompson 1991, Lie 1993). Indeed, there are no self-sustaining 
markets without some form of rules and regulations—the Hobbesian problem 
of order remains a perennial predicament (cf Parry & Bloch 1989, Lie 1992). 

Social economics and socioeconomics both take off from deep engagement 
with moral questions. Social economics, centered around the journal Review 
of Social Economy, is a variant of institutional economics. It seeks an interdis- 
ciplinary understanding of socioeconomic behavior with an explicit intent to 
enhance social welfare (Lutz 1990, O’Boyle 1996). Social economists there- 
fore criticize the neoclassical market not only on scientific but on moral grounds 
as well. Similarly, Amitai Etzioni has called for socioeconomics, which is 1n- 
spired in part by Talcott Parsons’s insight that the economy should be analyzed 
as a subsystem of the larger social system, as well as by Kantian moral philos- 
ophy. He stresses the role of duties, trust, cooperation, and other integrative 
principles and mechanisms not only as descriptive tools but also as prescriptive 
principles (Etzioni 1988, Etzioni & Lawrence 1991). Etzioni’s socioeconomics 
is thus far more notable for its humanistic impulse—as his advocacy of commu- 
nitarianism suggests—than for its scientific accomplishments (cf Cantor et al 
1992). 

In addition, numerous writings combine scientific and moral impulses in the 
study of markets. This is explicit, for example, in the advocate of the market 
and laissez-faire economic and social policy (Waligorski 1990). Radical critics, 
in turn, denounce the market and its deleterious consequences. The market is 
blamed for social inequality, environmental destruction, and communal disinte- 
gration (Herman 1995, Baum 1996, Glasman 1996). In this vein, Robert Lane 
(1991) makes the most sustained case for changing the evaluation of market 
performance from purely economic to mcre political and moral standards (cf 
Scitovsky 1992). 

Moral criticisms are an indispensable part of public discourse and play a sig- 
nificant role in shaping social-scientific theories of markets. The intersection of 
economic and moral theory has become an especially fertile field of intellectual 
inquiry (Farina et al 1996, Hausman & McPherson 1996). Nonetheless, moral 
critiques and concerns are inadequate in and of themselves to advance the effort 
to clarify our understanding and explanation of markets. 


Economic Anthropology and Culture 


In surveying economic anthropology in the 1990s, the formalist-substantivist 
debate no longer offers a compelling starting point (cf Halperin 1994, Wilk 
1996). The putatively sharp division between market and nonmarket (e.g. gift) 
exchange obscures the diversity of markets, just as the insistent distinction be- 
tween the West and the rest exaggerates the difference between us and them 
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(Thomas 1992). The blurring of the boundaries between the industrial and 
nonindustrial economies leads some to anthropological analysis of the West 
(Carrier 1995). At the same time, others stress the motive of gain and the im- 
portance of market exchange in nonindustrial societies (Davis 1992, cf Gregory 
1982:8—9). 

Several lines of inquiry in economic anthropology are of interest. G. William 
Skinner's classical work (1964—1965) on the rural Chinese trading networks 
takes seriously the impulse to generalize and to consider geography and history. 
Developed as regional analysis, several formal models exist to make sense of 
local and regional trading networks (Smith 1976), although their applicability 
to different cultures and time periods remains contested (Wanmali 1981). Polly 
Hill's (1972) analysis of the Hausa economy (Nigeria) demonstrates sophisti- 
cated understanding of macrosociological and historical forces impinging on 
local markets (see also Hart 1982). Macfarlane (1987), Halperin (1988), and 
Gudeman (1986, Gudeman & Rivera 1990) stress the importance of culture in 
shaping economic categories and institutions. There are interesting works on 
gendering gift exchange (Strathern 1988) and on women traders (Clark 1994). 
Roy Dilley's edited collection (1992) deserves special mention (see also Plattner 
1985, 1989). 

Bthnographic observations yield descriptive richness missing in the eco- 
nomic approach (Gregory & Altman 1989), and ethnographies are veritable 
treasure troves of insights on markets (Firth 1939, Bohannan & Bohannan 
1968, Strathern 1971, Beatty 1992). Paul Bohannan's (1955) discussion of 
distinct spheres of exchange and Mary Douglas's (1967) analysis of controlled 
exchange are redolent with implications for analyzing contemporary markets. 
There are also excellent studies of the relationship between inequality and mar- 
kets (Hill 1982, Harriss 1984a) and that between states and markets (Bates 
1981, Harriss 1984b). Nonetheless, in the age of globalization, fieldwork by 
itself may very well miss translocal and historical factors (cf Roseberry 1988, 
Thomas 1991, Carson & Harris 1995). 

In addition to anthropologists, historians and sociologists have also investi- 
gated the interaction of culture and markets. Two contributions at the intersec- 
tion of economic and cultural history are noteworthy. Jean-Christophe Agnew’s 
(1986) Worlds Apart grounds its cultural history of England in the shift from an 
economy dominated by marketplaces to one dominated by placeless markets. 
The economic change underpins his analysis of the changing cultural tropes 
of early modern England. William Reddy's exploration of modern French so- 
ciety stresses the "mirage" of market society. According to Reddy, France 
became not so much a market society as a market culture— "when the language 
of the mirage insinuated its assumptions into the everyday practice" (Reddy 
1984:1—2); the dominance of the market category contributed to the hegemony 
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of capitalist order. Reddy argues that reformist and revolutionary efforts failed 
in part because they did not articulate a critique of market categories (see also 
Reddy 1987). 

Sociologists have also explored the interrelationship between culture and 
markets (Chang 1991, DiMaggio 1994). Viviana Zelizer’s books (1983, 1987) 
exemplify a cultural approach to the study of markets. Zelizer (1988:618) calls 
for a "multiple markets" model: “the market as the interaction of cultural, 
structural, and economic factors." In so doing, she seeks to avoid both the over- 
socialized and undersocialized conceptions of economic action and institutions 
(see also Blau 1993). 

Inspired in part by the cultural turn in the social sciences, consumption has 
become a popular terrain of inquiry. Here there are several distinct threads. 
Economic, social, and cultural historians analyze the birth of modern consumer 
practice and society (Thirsk 1978, McKendrick et al 1983, Shammas 1990, 
Lebergott 1993, 1996). Several anthropological studies link the study of con- 
sumption with that of markets (Colson & Scudder 1988, Miller 1995). There 
are, furthermore, Marxist and Weberian theoretical frameworks for analyzing 
consumption (Campbell 1987, Fine & Leopold 1993, Fine et al 1996). In addi- 
tion, a large literature on marketing research is at once diverse and informative 
(Brown 1995, Thomas 1995). 


Embeddedness and Social Networks 


The revival of economic sociology in North America has catapulted Mark 
Granovetter's 1985 article into prominence as its programmatic text and the 
embeddedness approach as its primary framework (Swedberg 1991). Con- 
sciously departing from the old economic sociology of Talcott Parsons and his 
colleagues, the proximate theoretical inspiration of the embeddedness approach 
js Karl Polanyi's work, especially his collaborative book Trade and Market in 
the Early Empires: “The human economy... is embedded and enmeshed in 
institutions, economic and noneconomic" (Polanyi et al 1957:250). 

The bedrock assumption of the embeddedness approach is that social net- 
works—built on kinship or friendship, trust or goodwill—sustain economic 
relations and institutions. The basic idea is as old as sociology itself, such as in 
Durkheim's critique of Spencer—the existence and necessity of pre-contractual 
elements in contracts. To be sure, the insight is not unique to sociology. In 
Industry and Trade—one of the few texts where market is discussed by a 
prominent economist—Alfred Marshall (1920:182) wrote: "Everyone buys, 
and nearly everyone sells... in a ‘general’ market... . But nearly everyone 
has also some ‘particular’ markets; that is, some people or groups of people 
with whom he is in somewhat close touch: mutual knowledge and trust lead 
him to approach them... in preference to strangers.” Although Marshall was 
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responsible for widening the gulf between the study of economy and of politics, 
his work contains acute sociological insights (cf Collins et al 1983: Ch. 10). 
Goodwill, trust, and special tastes are but three of the many factors that sustain 
network ties that circumvent the simple calculus of utility maximization. 

Most contemporary American economic sociologists work under the ban- 
ner of the embeddedness approach. In particular, network analysis is central 
to the embeddedness approach (Powell & Smith-Doerr 1994, cf Axelsson & 
Easton 1992). The central motif of Granovetter’s oeuvre is that "social relations 
are fundamental to ‘market processes"' (Granovetter 1985:500). Granovetter 
(1994) himself expands on his programmatic article by considering a number 
of spheres ranging from the labor market to business groups. 

Along with Granovetter, two sociologists exemplify the embeddedness ap- 
proach. Harrison White’s (1981) analysis of production markets as role struc- 
tures was the first substantive work of the new economic sociology. He extends 
his analysis by stressing struggles for control and autonomy that generate the 
market as a social category (White 1992, 1993). Ronald Burt (1983, 1992) 
pursues a broadly network approach in his studies of contemporary US markets 
and competition. His conceptualization of structural hole—the agglomeration 
of ties with fewer relations among themselves—is not only a provocative idea 
but a fully operationalized empirical project. 

Several scholars extend the work of Burt, Granovetter, and White. In addition 
to his study on corporate behavior (Baker 1990), Wayne Baker, in conjunction 
with Robert Faulker, incorporates structural, interactionist, and network consid- 
erations into White’s analysis of markets as role structures (Baker & Faulkner 
1991). Joel Podolny (1993) argues for the salience of status order and posi- 
tions, rather than roles, in making sense of production markets (cf Han 1993). 
Brian Uzzi (1996) combines various threads in order to better operationalize 
the embeddedness approach. In another vein, Neil Fligstein (1996) advocates 
a political-cultural approach that combines the insights of the embeddedness 
approach with that of the cultural frame approach. He employs his frame- 
work to analyze the problem of the European Union (Fligstein & Mara-Drita 
1996). 

The embeddedness approach is salutary in stressing social relations and net- 
works. In avoiding both the oversocialized (e.g. the substantivist school in 
economic anthropology) and undersocialized (e.g. the economic approach) ap- 
proaches, it seeks to strike a correct balance in analyzing markets and other 
economic phenomena and institutions. In so doing, it registers the existence 
of disparate market relations and institutions. Harrison White (1992, 1993), in 
particular, emphasizes power and control struggles as a crucial constituent of 
markets. The embeddedness approach, 1n other words, avoids market essen- 
tialism and incorporates power. 
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Nonetheless, there are several grounds of criticism. The structuralist tenor 
of the embeddedness approach generally neglects nonsocial or nonstructural 
factors, such as culture, technology, and even macroeconomic forces. In spite 
of avoiding market essentialism, the embeddedness approach in practice largely 
eschews analyzing historical and cultural variations in markets. Power, espe- 
cially in the noneconomic realm, remains elided—there is nary a mention of 
the state in many accounts. In addition, Marx’s crucial insight—the structured 
inequality underlying the formal equality of the market—is well worth recover- 
ing. Social networks exist inevitably within the larger historical and structural 
context. The embeddedness approach must itself be embedded in larger, histor- 
ically transient, social structures—not only state institutions and suprastate or- 
ganizations, but also historically shifting transnational relations and structures. 


Politics and Markets 


The classic question of power and wealth and the concern with big structures 
and large processes continue to be fundamental. But the economic approach, 
as I noted, disentangles markets from power considerations. As OH Taylor 
(1955:116) observed in 1934, “the real economist’s utopia is now and for- 
ever laissez faire... of a scheme of policy eliminating all interference of all 
‘politics’... " The disembedded economy—an idea traced to seventeenth- 
century England (Appleby 1978)—accompanies the belief in the separation 
of politics from markets as a desirable condition. Power is, however, a cru- 
cial fact of economic and social life, whether one seeks to understand cor- 
porate behavior or the interventionist state. Even in an ostensibly abstruse 
economic phenomenon, such as inflation, the politics of social classes is im- 
portant (Goldthorpe 1978). Or, consider the fact of corporate downsizing in the 
1990s. Although many economists would attribute massive lay-offs to market 
forces, David Gordon (1996) argues for the crucial importance of corporate 
decision-making. In other words, corporate or managerial power is important, 
and its neglect obscures the crucial forces at work in the economy. 

The locus classicus of the new political economy approach that brings power 
and markets together is Charles Lindblom’s Politics and Markets (1977) (see 
also Shonfield 1965). The relationship between states and markets has gener- 
ated a large scholarly literature (Hall 1986, Freeman 1989, Przeworski 1991, 
Hollingsworth et al 1993, Schwartz 1988). Particularly noteworthy are com- 
parative studies that explore the different parameters of state intervention and 
market structure (Sabel & Zeitlin 1985, Dobbin 1994, Herrigel 1996, Schmidt 
1996). There are also interesting works on the role of regulation in shaping 
market structure and competition (Vietor 1994, Mercer 1995). 

The new political economy approach demonstrates the extensive state in- 
tervention in the modern economy. Furthermore, political institutions and 
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processes are viewed as constitutive of economic institutions and processes. 
This is especially true for East Asian economies such as Japan (Sheridan 1993, 
Vestal 1993, Uriu 1996). Similarly, studies of South Korean and Taiwanese de- 
velopment repeatedly find the preponderant role of the state in markets (Wade 
1990, Fields 1995). Simon Reich (1990) argues that the post-World War II 
successes of Japan and West Germany are due in part to the institutional legacy 
of prewar state intervention. The debate over the decline of the British economy 
is an interesting case study (Alford 1993). Although some argue that state in- 
tervention in markets led to sluggish economic performance (Middleton 1996), 
others point out the deleterious consequences of Thatcher’s market revolution 
(Gamble 1994). 

Beyond the nation-state, there are studies on governance structures in the 
global economy (Michie & Smith 1995, Cox 1996). International trade and 
global markets remain areas of political intervention (Hirschman 1980, Milner 
1988, Salvatore 1993). The interpenetration of interstate relations, state eco- 
nomic policies, and markets is a promising area of inquiry (Wurm 1993, 
Keohane & Milner 1996). 


From Plan to Market 


In the early 1990s, perhaps the most salient market-related issue was the tran- 
sition from planned economy to market, especially in Eastern Europe. Indeed, 
as World Development Report 1996 noted in its subtitle, “From plan to market,” 
privatization or market transition is a problematic for virtually all of the world 
(World Bank 1996). 

The end of communism created an avalanche of proposals to privatize the 
formerly communist economies and to effect a transition from planned econ- 
omy to market. There are several overarching overviews (Ramanadham 1993, 
Jeffries 1996; see also Centeno 1994). In general, economists advocate priva- 
tization and marketization (Blanchard et al 1991, Sachs 1993). To the extent 
that there are problems with privatization, they stem largely from the prevail- 
ing political power groups (Boycko et al 1995). What neoclassical economists 
prescribe is a wholesale transformation from plan to market. Empirical inves- 
tigations, however, reveal the complexity of institutional legacies that differ 
across Eastern Europe. Just as pretransition economies were not simply cen- 
tralized (Prout 1985, Szelényi 1989, Berend 1990, Aslund 1992; cf Kornai 
1992), the posttransition strategies and outcomes are not all of one piece (Stark 
1990, 1992, Burawoy & Lukács 1992, Bryant & Mokrzycki 1994, Kovács 
1994). The same can be said about the market transition 1n China (Nee 1989, 
1992, Shirk 1993, Wank 1993). In particular, comparative studies illuminate 
the diversity of the seemingly singular transition (Nee & Stark 1989, Walder 
1995). 
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Globalization has also generated a great deal of hype and debate, leading 
some to proclaim the imminent arrival of a global market without national 
borders (cf Hirst & Thompson 1996). As with the transition in Eastern Europe, 
discussions of globalization are impoverished by the assumption of market 
essentialism. Historical or comparative variations are elided in favor of the 
quintessential market. In the economic approach, the expansion of the world 
market occurs across frontiers without prior structures of exchange relations, 
but such an image ignores the extensive commodity exchange networks in 
noncapitalist economies (Hill 1986). 

Following Karl Polanyi's The Great Transformation (1957), scholars have 
sought to depict both globalization (market expansion) and its deleterious con- 
sequences (social responses) (Overbeek 1993, Stallings 1995, Walton & Seddon 
1994, Mittelman 1996). The impact of globalization is especially interesting 
in Latin America, where many accounts depict far more negative portraits of 
privatization and market expansion than do their counterparts in Eastern Europe 
(Taylor 1990, Smith et al 1994, Collins & Lear 1995, cf Ryan 1995). 


Conclusion 


Ás the empirical literature on market transitions suggests, the neoclassical mar- 
ket concept elides different types of market exchange. The most important 
and compelling evidence of market diversity is the work of Fernand Braudel 
(1982), especially the second volume of his magnum opus Civilization and 
Capitalism. Braudel synthesizes a staggering amount of information to present 
the sheer diversity of trade and markets across history and cultures. In spite 
of the rich array of materials he gathers, Braudel eschews formalistic or ab- 
stract presentations (see also Polanyi et al 1957, Hodges 1988). In the absence 
of a compelling theoretical alternative, however, we are left with descriptive 
diversity but theoretical monism. 

The assumption of market essentialism forecloses considerations of alter- 
native forms of exchange relations and structures. Given the historical and 
comparative diversity of market relations and institutions, there is at least a 
prima facie reason to consider alternative arrangements. If the only alterna- 
tives were plan and market, then market socialism, for example, would be an 
oxymoron. This claim has been made by both neoclassical economists and 
Marxists (cf Putterman 1990, McNally 1993, Arnold 1994, Stiglitz 1994). In 
fact, however, there are interesting studies on market socialism based both on 
empirical studies and on theoretical speculations (Brus 1975, Nove 1983, Le 
Grand & Estrin 1989, Miller 1989, Bardhan & Roemer 1993, Roemer 1994). 
In addition, alternative conceptualizations of exchange mechanisms include co- 
operatives and participatory economies (Clayre 1980, Elster & Moene 1989, 
Albert & Hahnel 1991, Ellerman 1992, Archer 1995). 


354 LIE 

In closing, I should stress that empirical eclecticism provides no panacea. 
Theoretical questions about the significance and role of power in social life or of 
macrosociological foundations provide contradictory answers. The economic 
approach is, in this view, incompatible with an elementary understanding of 
sociology (as much as there are sociologists who would wish to remedy this 
situation). In order to advance theoretical works on markets, the assumption of 
market essentialism should be jettisoned in favor of describing and analyzing 
the empirical diversity of actually existing markets. In addition, power and 
macrosociological foundations need to be better theorized. One possibility is 
to seek an integration of the embeddedness approach with the new political 
economy (“politics and markets”) approach. 

Many sociologists would regard sociology of markets or even economic so- 
ciology as outside the domain of sociology proper. I would argue that it is, in 
fact, quite crucial for sociologists to study markets. Consider only its valence 
in policy discussions. James Tobin (1980:46) notes: "The view that the market 
system possesses... strong self-adjusting mechanisms that assure the stabil- 
ity of its full employment equilibrium is supported neither by theory nor by 
capitalism’s long history of economic fluctuations.” But free market policies 
remain popular, whether in employment policy or macroeconomic policy. As 
we consider the intellectual and political significance of understanding markets, 
Wicksteed's (1950:784) conclusion remains relevant today: “the better we un- 
derstand the true function of the ‘market,’ in its widest sense, the more fully 
shall we realise that it never has been left to itself, and the more deeply shall 
we feel that it never must be. Economics must be the handmaid of sociology.” 
The study of markets is too important to be left to economists. 
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ABSTRACT 

In the past few years, the area of politics and culture has moved from the margins 
of cultural inquiry to its center as evidenced by the number of persons who iden- 
tify themselves as working within the area and by its growing institutionalization 
within sociology. “Politics and culture” suggests that each term constitutes an 
autonomous social realm; whereas "political culture" suggests the boundaries 
of cultural action within which ordinary politics occurs. Bourdieu's emphasis 
on boundary making, Foucault's disciplinary mechanisms, and Habermas's con- 
ception of the public are setting the research agenda of scholars who focus on 
macro-level social change. Interdisciplinary dialogues are emerging, conducted 
on a landscape of historical and contemporary empirical research. Four sub-areas 
have crystallized: first, political culture, which focuses on problems of democra- 
tization and civil society; second, institutions, which includes law, religion, the 
state, and citizenship; third, political communication and meaning; and fourth, 
cultural approaches to collective action. Promising directions for future work 
are historical ethnographies, participant observation and interview studies of po- 
litical communication, and studies of political mobilization that examine how 
emotion operates in politics. Paradigms are not yet firm within this area, suggest- 
ing that politics and culture is a disciplinary site of theoretical, methodological, 
and empirical innovation. 





MAPPING THE FIELD 


Three years ago I wrote a preliminary review of current research on politics 
and culture (Berezin 1994a). I oriented my first review toward methodology 
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and asked how sociologists might pose relevant questions that wed culture to 
political analysis. My premise was that academic discourse was beginning 
to link the words “politics” and “culture” with more frequency than analytic 
precision. I argued that sociologists should guard against the seductions of the 
linguistic turn and be wary of the idea (influenced more by literary than social 
theory) that representations of power equal realities of power (Freedberg 1989, 
Friedlander 1992). 

Sociology’s strongest potential contribution to this emerging and hybrid field 
lay 1n empirically grounded studies that built on the traditional areas of political 
analysis such as states, social movements, and even voting behavior. In 1993, 
much of the empirical work was located in history (Hunt 1989), anthropology 
(Cohen 1974, Kertzer 1988, Ortner 1984), and to a lesser extent in political 
science (Laitin 1986). While sociologists studied problems that fell under the 
rubric of politics and culture (for example, Goldfarb 1991, Bellah et al 1985, 
1991, Hunter 1991), very few labeled their work as such.! Much sociolog- 
ical discussion of politics and culture consisted of theoretical debates about 
postmodernism (Alexander 1992, Giddens 1990), history (Calhoun 1992, Hall 
1990), and power (Lamont 1989, Mitchell 1990). 

In the three short years that have elapsed since the writing of that first review, 
a sea change has occurred in the perception of politics and culture within so- 
ciology, and steps have been taken toward institutionalizing it as a disciplinary 
subfield. First, the number of persons who identify their work as falling within 
the field has increased. For example, the American Sociological Association's 
(ASA) section on Sociology of Culture now contains a working network de- 
voted exclusively to issues of Politics and Culture. À brief perusal of the ASA 
annual meeting program for the last two years reveals whole panels devoted 
to politics and culture, national identity and political symbolism. Second, a 
number of major university presses now have book series devoted to issues 
of culture and politics.? Third, dialogues are emerging within sociology con- 
ducted on a landscape of historical and contemporary empirical research. For 
this review, I have identified four sub-areas that have crystallized within the last 
three years: first, political culture, which focuses on problems of democrati- 
zation and civil society; second, institutions, which includes law, religion, the 


l'The broad sweep of Hunter's (1991) and Goldfarb’s (1991) analyses of contemporary American 
cultural dilemmas follows in the tradition of Bell's (1976) The Cultural Contradictions of 
Capitalism 

2The most prominent among these are: Craig Calhoun’s “The Contradictions of Modernity” at 
Minnesota, Jeffrey C. Alexander’s and Steven Seidman’s "Cambridge Cultural Social Studies" at 
Cambridge; David Laitin’s and George Steinmetz's "Wilder House Series in Culture, Politics and 
History" at Cornell; and Lynn Hunt's and Victoria Bonnell's "Studies m the History of Society and 
Culture" at Berkeley. 
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state, and citizenship; third, political communication and meaning; and fourth, 
collective action. 


POLITICAL CULTURE VERSUS POLITICS 
AND CULTURE 


The terms “political culture” and “politics and culture” are not reducible to each 
other. Failure to draw a distinction between them leads to analytic imprecision 
and linguistic confusion. As this distinction is implicit in much of the literature 
I review, it is important to flag at the outset. I distinguish between the two 
terms as follows. “Politics and culture” suggests that each term constitutes an 
autonomous social realm. It suggests that there are broad cultural themes that 
are sometimes mobilized in the service of politics and sometimes not. These 
themes would exist independently of their political uses. The task of the social 
analyst is to understand how the two intersect. 

Sewell’s (1992) discussion of the play of schema and resources in social 
transformation as well as Swidler's (1986) analysis of culture as “tool kit” are 
useful analytic frames for specifying the distinction and testing the interaction 
between politics and culture. Sewell argues that schemas are deep rules of 
behavior that can be used as resources by those who know and wish to change 
the rules of the cultural game. Language is the vehicle that enables agents to 
act upon structures, as well as the cultural form that activates transformation. 
Agents may dip into Swidler’s cultural "tool-kit" when the available repertoire 
of social and political practices requires reinforcement, when it changes or col- 
lapses completely. Sewell and Swidler ascribe a relative autonomy to culture. 
Cultural rules, symbols, and practices may serve to create a new political cul- 
ture, or they may simply contribute to the political culture that is already in 
place. 

Luker's (1984) study of abortion activists and Merelman's (1991) compara- 
tive study of political culture in Canada, Britain, and the United States convey 
the spirit of my distinction even if they do not rely on either Swidler's or Sewell's 
definitions. Luker discovered that the cultural meanings that women attached 
to the role of motherhood made possible prediction not only of their stance on 
abortion but also their level of activism. Feelings and meanings about moth- 
erhood are latent and cultural and exist independently of political exigencies; 
however, certain types of political issues, such as abortion, have the capacity to 


?While I have tried to be as inclusive as possible in citations, space constraints have forced me 
to focus on trends, published as opposed to unpublished work, and frequently books rather than 
articles. Many of the citations could be in multiple categories but for the purposes of simplicity 
I have grouped them in the area where they have had the most impact Needless to say, the four 
sub-areas which I have identified merit their own Annual Review articles. 
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force those meanings to the surface where they may become the foci of political 
mobilization. Merelman studied cultural artifacts such as television sitcoms, 
magazine advertisements, school textbooks, and corporate publications to de- 
termine what types of popular narrative shaped democratic ideas and practices. 
He discovered that representations of the family revealed national differences 
in perceptions of collectivity and individualism that link these underlying cul- 
tural ideals to the types of democracy present in Britain, the United States, and 
Canada. In Merelman’s study, culture shaped political practice. 

In contrast to “politics and culture,” “political culture” I view as the matrix 
of meanings embodied in expressive symbols, practices, and beliefs that con- 
stitute ordinary politics in a bounded collectivity. The debate between Laitin 
and Wildavsky (1988), Inglehart’s call for a “renaissance” in political culture 
(1988), and Eckstein’s discussion of political development and culture (1988), 
as well as numerous review articles (Chilton 1988, Davis 1989, Brint 1994a), 
suggest that political scientists rediscovered political culture in the late 1980s. 
In the "old" literature (Almond & Verba 1989 is the classic reference here), po- 
litical culture meant political attitudes studied quantitatively [with rare excep- 
tions such as Banfield’s (1967) ethnographic account of political stagnation in a 
southern Italian village]. The assumption was that attitudes directly determined 
political practices. Laitin and Eckstein present the clearest challenge to that ap- 
proach. Laitin calls for historical and ethnographic accounts of politics that view 
culture as “points of concern” where symbols and language are appropriated 
instrumentally to achieve cultural and political ends. Eckstein, invoking Talcott 
Parson’s action theory, argues that culture affects political transformation by 
determining the context in which social actors make collective and individual 
political decisions. Wilson (1992) develops an institutional theory of political 
culture. 

Social scientists have not completely abandoned the study of political atti- 
tudes. For example, Inglehart (1990) contends that since the mid-1970s, what 
he terms “post-materialist’ values have shaped political attitudes in Europe. 
Inglehart uses attitudinal data taken from the Eurobarometer survey to explain 
what others have called New Social Movements (Cohen 1985), that is, move- 
ments that focus on identities and values, e.g. women’s movements, anti-nuclear 
movements, and ecology movements. Brint (1994b) uses American survey data 
to explore the political behavior of middle class professionals. Szelenyi et al 
(1996) use results derived from demographic and public opinion survey data to 
explain the unexpected 1994 Hungarian election results in terms of the “strength 
of social democratic sentiment.” 

The new attitude studies, while more theoretically sophisticated and histori- 
cally embedded than their predecessors, exist more as methodological residues 
than as harbingers of new directions. The new literature on political culture is 
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theoretical in that it challenges the concept itself, and it is historical in its care- 
ful attention to context and decided focused upon ethnographic and historical 
methods. Much of the focus of this literature has been on issues of civil society 
and democratization—at the level of both theory and empirical analysis. 


Theories of Civil Society 

Before 1989, civil society theorists were primarily concerned with the revivifi- 
cation of democratic socialist political practice (Mouffe & LaClau 1985, Keane 
1988, Barber 1984). Political events and the resurgence of ethnic nationalism 
in the former Eastern Europe have forced a shift in focus. While there is a 
clear historical reason for sociological interest in civil society, the question re- 
mains as to why civil society is important and how this is linked to questions 
of democratization. 

Civil society is generally conceptualized as the social space in which a demo- 
cratic polity is enacted. What constitutes "social" ranges from every institution 
that is not market- or state-based to relatively restrictive conceptions that focus 
on family and what Alexis de Tocqueville called mediating social institutions. 
While the term originates with Hegel, Jurgen Habermas's Structural Transfor- 
mation of the Public Sphere (1989), particularly after its English translation in 
1989, has established (whether it is acknowledged or not) the terms of much 
of the debate. According to Habermas, what separates democratic practices 
from what has gone before is the existence of a separate sphere that is neither 
state nor church, where individuals form publics and debate political ideas and 
practices. Habermas's ideal of the public sphere is the eighteenth century cof- 
fee house, the salon, and the rise of a free press. His contribution is inherently 
cultural because, in contrast to other theories of democratization that simply 
link it to markets and the development of nation-states, Habermas argues for a 
communicative space in which democratic practices and values evolve. 

Ihe academic discussion of civil society begins with elaborations of 
Habermas's idea (Cohen & Arato 1992). Kumar & Weintraub's anthology 
(1996) underscores the importance of the division between public and private. 
Challenges to the idea of civil society include the charge that it is historically 
specific and lacks analytic power in view of contemporary realities (Kumar 
1993, Bryant 1993, Taylor 1990, Calhoun 19932). Seligman (1992) argues that 
the “idea” of civil society is a moral concept that originates in the Scottish en- 
lightenment. According to Seligman, a vibrant civil society such as that in the 
United States depends upon a political culture is which the boundaries between 
public and private are firmly held. For this reason, he is dubious about applying 
the concept to political developments in the former Eastern Europe. 

Somers (1995a,b) criticizes current discussions of politics and culture be- 
cause they do not take into account the historical specificity of the terms of 


366 BEREZIN 


debate. Focusing on the “old” political culture literature and Habermas’ dis- 
cussion of a public, she argues for a historical sociology of concept formation 
and locates the idea of Anglo-American citizenship theory in its cultural and 
historical milieu. Somers’ principal contribution lies in the argument that even 
the most taken-for-granted building block of modern democracy is highly con- 
textual and that we must be cautious about simply taking these terms up in toto 
and applying them to other temporal and spatial locales. In a series of articles 
(Alexander & Smith 1993, Alexander 1994, Alexander 1996), Alexander offers 
an innovative approach to civil society that introduces the conception of soli- 
darity. He argues that members of the polity experience belonging in a series 
of binary discourses in which they articulate issues of good and evil. Those 
discourses generate an important, but unstated, dimension of civil society—a 
feeling of membership and community. 

Empirical work on the public sphere is just beginning to emerge. The essays 
in Calhoun’s (1993d) anthology take up the issue of gender and cross-national 
variation of public sphere development. Somers (1993) examines divergent 
conceptions of citizenship in different parts of Britain in the prerevolutionary 
period. Her argument is that, rather than being an attribute of persons based 
upon rights (the Marshall distinction), citizenship is relational and is reflective 
of a diverse set of local institutional arrangements that give rise to different 
notions of a public sphere and political cultures. Building on earlier work 
(Zaret 1989), Zaret (1996) argues that much discussion of the public sphere 
has failed to look closely at the available forms of political communication. 
By examining the evolution of “petitions” in sixteenth century England, he ts 
able to locate the origins of the public sphere not in the development of the 
bourgeoisie and Enlightenment ideology but in the effect that printing had on 
the medieval practice of petitioning local authorities. This practice of political 
communication was open to all social classes, and Zaret argues that it reveals the 
populist roots of the public sphere and allows it a greater democratic potential 
than critics of modernity would suggest. Variations on this theme include 
studies of the socialist public sphere (Tucker 1996) and the cultural dimensions 
of democracy (Creppell 1989, Edles 1995). 


Democracy and Culture 


The debate about civil society and democracy has not remained at the purely 
theoretical level. Bellah and his colleagues, who would probably not label 
themselves civil society theorists, have provided a prominent empirical contri- 
bution to the debate about democracy in the American context. Bellah’s (1980) 
comparative study of “civil religion,” which evokes Rousseau’s conception of 
“republican virtue,’ calls for an active political community with citizen partic- 
ipation. Bellah's collaborative works, Habits of the Heart (Bellah et al 1985) 
and The Good Society (Bellah et al 1991), placed the idea of political culture 
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and democratic practice firmly on the American research agenda. Turning 
to Alexis de Tocqueville rather than contemporary continental social theory, 
Bellah and his colleagues interviewed Americans about their participation in 
the polity and how they viewed themselves as political actors and members 
of a political community. Both books call for a new public spiritedness based 
upon the revivification of community and public commitment, and both see a 
retreat to individualism as one of the features that has weakened contemporary 
American democracy. 

Lichterman (1995, 1996) conducted a field study on the local level of two 
American environmental protection groups, and he challenges Bellah’s conclu- 
sions. Rejecting what he terms the “seesaw model” (i.e. one is either a liberal or a 
communitarian), he argues that the retreat to "personalism" and self-fulfillment 
does not have to lead to a lack of public spiritedness. His study suggests that an 
ethic of personal fulfillment in some circumstance may contribute to a renewed 
sense of political commitment and a reinvigorated democratic community. 

If Bellah's work has been the linchpin of discussion of American politi- 
cal culture, Robert Putnam’s Making Democracy Work (1994) and the debate 
around it (for example, Laitin 1995, Tarrow 1996) has become a focal point for 
research on international political culture. It has also had a surprising influence 
on public debate in the United States, as evidenced by the three-page article 
that Putnam had in no less a venue than People Magazine! Putnam, a political 
scientist, studied regional government in Italy over a 20-year period. Despite 
the apparently limited empirical focus of his work, its influence has extended 
well beyond the exigencies of Italian political development. Putnam argues 
that the practices of democracy are based on the trust that a high degree of 
social capital bestows. He argues that the success or failure of self-governance 
in Italy was linked to the patterns of cooperation and trust that were established 
in early renaissance city states. He argues that a vital civic culture is linked 
to the experience of cooperation learned in voluntary organizations that range 
from mutual credit associations to choral societies. 

Calhoun’s Neither Gods Nor Emperors (1994b), which focuses upon the 
Chinese student movement and the events that led to the massacres in Tiananmen 
Square, discusses how informal student networks were able to mobilize dissent 
even in a controlled society such as China. Calhoun argues that the value of 
mobilization in creating new identities and the idea of democracy and public 
participation are important dimensions of civil society and transformative pol- 
itics even if the movements themselves fail. Calhoun’s owes his emphasis on 
identity and democratic practice to Taylor’s (1989) discussion of the relation 
between “ordinary life” and political practice (Calhoun 1991). 

Another strand of the debate on civil society is the emerging literature on 
multiculturalism, which scholars usually discuss in terms of group rights. In the 
United States, the focus is on difference and social equality (Taylor 1994, Gitlin 
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1993, Spencer 1994); whereas in the non-American context, multiculturalism 
tends to be associated with ethnic conflict (Joppke 1996, Kymlicka 1995). 
Multiculturalism relates to civil society in that it takes up the issue of who shall 
participate in the polity based upon cultural identity. 

Within the literature I have reviewed under the rubric of "political culture,” 
there is an underlying, although unstated, anthropological view of culture as 
shared mental grids or meanings that is part of the process of a political practice 
such as democracy. 


STATES AND INSTITUTIONS 


Civil society and democratization discussions assume a political culture but do 
not explicitly theorize it. In contrast, the literature that I group under states and 
institutions begins with problems traditionally falling within the purview of 
political sociology and organizations. Lying closer to my discussion of politics 
and culture, this literature views culture as a variable that enters into the process 
of institution formation and action. 

State theory traditionally looks to finance, military structure, and bureaucracy 
to define the state. Analysts with an interest in culture have begun to question 
the adequacy of these variables in defining state formation. Jasper (1987) 
attributes the divergent outcomes in nuclear energy policy in France, Sweden, 
and the United States to the policy styles of government planners. The study by 
Berezin (1991) of artistic production and censorship in Fascist Italy argues that 
theories of state action apply in nominatively totalitarian as well as democratic 
states. Using game theory and focusing on language policy in Catalonia and 
Africa, Laitin (1989, 1992, Laitin & Sole 1994) attributes the political and 
social cohesion of the modern state to its ability to establish language repertoires. 
Ikegami (1995) studies Japan in the Tokagowa period and argues that “honorific 
individualism,” carried by permutations in Samarai culture, was integral to the 
formation of the Japanese variant of the nation-state. Burke (1992) identifies 
the "fabrication" of Louis XIV as an example of a person becoming a symbol 
to consolidate central state power. 

Foucault's (1977a,b) conception of discipline and the social has influenced 
new directions in state theory. In a theoretical exegesis on the concept of the 
state, Mitchell (1991b) argues that the state is merely a vector of structural 
forces. He elaborates this argument in his study of the Egyptian colonial state 
(Mitchell 19912), where he argues that the “power to colonize” is the power 
to impose meanings in the form of British hegemonic discourse. Steinmetz 
(1993) turns to Foucault's conception of the "social" to explain the particular 
iterations of the German welfare state on the local level. Gorski (1993) argues 
that theories of state formation have failed to adequately theorize the relation 
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between Protestantism and Calvinism and the form of the modern state. Draw- 
ing on both Foucault’s notion of public discipline and Elias’s conception of 
personal discipline, Gorski argues that Protestantism extended the notion of 
self discipline to the obligation to discipline the entire community. The wed- 
ding of Protestantism’s disciplinary imperative to collective organization was 
as important as war and bureaucratic rationality for the formation of the modern 
state. Lustick (1993) in his comparative study of territorial dispute in the Gaza 
Strip, Algeria, and Northern Ireland uses a Gramscian notion of hegemony and 
position to argue for a culturalist theory of colonial warfare. 

An emerging strand of state and culture literature is one that incorporates the 
idea of gender. Verdery (1994) summarizes this most succinctly when she talks 
of the nation state as being a gendered concept, with nation representing the 
soft feminine side of the equation and state representing the hard bureaucratic 
masculine side. Scholars are beginning to recognize the gendered nature of the 
state in a range of areas. Prominent examples of this approach include Hunt’s 
(1992) discussion of the “family romance,” Connell’s discussion of state and 
masculinity (1995), Borneman’s (1992) equation of state and kinship, as well 
as works on the gendered nature of citizenship (Orloff 1993) and patriarchy 
(Adams 1994).4 

The growing appeal of the “new” institutionalism within sociology has pro- 
pelled culture to the forefront of the study of policy, law, and even markets. The 
work of Thomas, Meyer and collaborators (1987) and the influential Powell & 
DiMaggio anthology (1991) have set the research agenda in this area. Friedland 
& Alford’s (1991) injunction to “bring society back in” summarizes much of the 
cultural dimension of institutionalism as it underscores the importance of rules 
and repertoires of practices and symbols in social behavior and organizational 
action. For example, Skrentny’s (1996) analysis of the cultural contradictions 
of affirmative action pays attention to the institutional context within which this 
policy emerged, while arguing that the notion of context has to be broadened 
to include multiplicity of rules, shifting rhetorics, and audience expectations. 
Fligstein (1996) argues that the political dimension of markets cannot be ade- 
quately understood if we do not understand the deeper cultural structures that 
give rise to them. Dobbin (1994) challenges the received wisdom that economic 
efficiency determines industrial policy choices. In a study of railway policy in 
Britain, France, and the United States, Dobbin argues that the diverse political 
cultures of these states contributed to the choices they made regarding railroad 
development. He thus shows that institutions themselves are as cultural as more 
typically studied cultural objects such as symbols. 


^For an explicit discussion of the relation between state and culture, see Steinmetz’s 1997 
anthology, State/Culture 
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In a novel vein, Soysal (1994) uses institutional theory to explore the issue 
of citizenship. Focusing upon the incorporation of immigrants into post-World 
War II European national labor markets, she argues that a conception and prac- 
tice of citizenship that had been theorized in the language of national rights 
has been replaced in the contemporary era by post-national citizenship. She 
argues that the discourse of citizenship is now a discourse of human rights and 
that this 1s carried by diverse international agencies, particularly the United 
Nations. This language of humanity and personhood institutionally buttressed 
by fluid national labor markets and international governing agencies decouples 
citizenship from territory. 

Most theorizing on citizenship does not go so far as Soysal to make a cultural 
argument for its transformation from its nineteenth century form into some- 
thing arguably postmodern. Brubaker (1992) focuses upon immigration policy 
in France and Germany and asks why two countries facing similar influxes of 
immigrants have not changed their policies to accommodate changing reali- 
ties. He argues, borrowing from Pierre Bourdieu, that citizenship is a boundary 
setting device that limits who is and who is not a member of the polity and 
that divergent notions of incorporation in France and Germany reflect the dif- 
ferent political cultures in which the idea of nation was conceived during the 
nineteenth-century period of state formation. 

Scholars who label themselves as institutionalists tend to focus on law and 
markets—standard political and economic institutions; students of religion, a 
potent cultural institution, are also contributing to these debates (Williams & 
Demerath 1992). The intersection of religion and organization figures promi- 
nently in several studies of culture and politics. David Laitin (1986) studies how 
Muslim and Christian culture and religious practices contributed to different 
political outcomes in different parts of Nigeria. Casanova (1994) links religion 
to politics by identifying what he terms public and private religions. Wuthnow 
(1989) studies the discourse and diffusion of Protestantism and its relation to 
other forms of political ideology. Zaret (1985) in his study of covenant theology 
rewrites Walzer (1965) by pointing to the role of organization in the diffusion 
of Protestant ideology. A somewhat related study is Kertzer’s Comrades and 
Christians (1980), which analyzes how the Italian communist party appropri- 
ated Roman Catholic symbols and institutions to broaden its constituency in 
the 1970s. 

A renewed focus on nationalism has developed in parallel to the interest 
in institutions, states, and culture. Anderson’s (1991) Imagined Communi- 
ties, which points to the constructed nature of nation/state coupled with the 
contemporary reemergence of nationalist movements, has fueled this research 
agenda (see for example, Calhoun 1995b). The literature on nationalism 1s vo- 
luminous and growing (Calhoun 1993c). The central debates in this literature 
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concern whether nationalism represents a retreat to primordial sentiments fu- 
eled by class resentment (Greenfeld 1992), an instance of nation/state building 
(Sahlins 1989, Medrano 1995, Smith 1995), or a form of ideological imperial- 
ism (Chatterjee 1993, Bhabha 1990). National identity, and identity in general, 
had been the focus of much theorizing (AD Smith 1991, 1996, Gellner 1987, 
Somers 1994, Somers & Gibson 1994, Calhoun 1994a, 1995a, Handler 1994) 
with emerging empirical contributions (McDonaugh 1995, Meadwell 1990, 
Eyal 1996, Phillips 1996). Verdery’s (1991) study of Rumania, using a frame- 
work borrowed from Foucault and Bourdieu, explores identity formation under 
conditions of totalitarianism. Ringmar (1996) is one of the first social analysts 
to offer a “cultural explanation” of a historical event, Sweden’s entry into the 
Thirty Years War, that draws on theories of identity and nationhood. Brubaker 
(1996) offers an innovative approach to the study of nationalism, which ties 
together the culturalist and institutionalist versions of national problematizing. 
Focusing on the former Soviet Union, Brubaker argues that nationalist projects 
are best understood as sets of practices that get institutionalized under certain 
conditions. 


POLITICAL COMMUNICATION AND MEANING 


Culture functions as an exogenous variable in institutional and state-oriented 
studies whose principal purpose is to explain some organizational or policy 
outcome. Political communication and meaning is the largest and growing 
sub-area within politics and culture. Cultural practices such as rituals and sym- 
bols, broadly defined to include national constitutions and histories as well as 
more standard aesthetic objects such as art and literature, provide the empirical 
focus for analysis of the dissemination of knowledge about politics. Sociolog- 
ical contributions attempt to theorize the autonomous, or sometimes relatively 
autonomous, status of culture. 

Political communication and meaning is the most theoretically and empir- 
ically eclectic sub-area of politics and culture at the moment. The range of 
work and the objects of study vary as well as the literature cited. In focusing 
on the political meaning of cultural symbols, objects, and practices, the focus 
shifts from the production of meaning to the reception of meaning (Brown 
1993, Cohen 1989, Goldberg 1991, Breckenridge 1989, Gambetta 1991). Pro- 
duction analyses, such as institutionalism, tell plausible stories about how cul- 
ture constrains and enhances the actions of political actors and organizations. 
The studies that I now consider focus on how political audiences—ordinary 
citizens—Trespond to cultural politics. Although response is the key issue in this 
body of work, that does not suggest that practitioners have solved the problem 
of impact. One way to obtain this information is to ask citizens what they think 
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and feel about politics, as did Lane (1962) 1n his study of lower-middle-class 
men in New Haven, and Gamson (1992) in his focus group study of contem- 
porary Ámerican politics. Eliasoph (1990, 1996) uses a snowball sample and 
asks men and women “on the street" what they thought of the Iran-Contra hear- 
ings; in later work she explored the issue of political apathy by listening to 
the language of ordinary citizens. In both studies, Eliasoph finds that people 
interpret politics in ways that are counterintuitive to the manner in which we 
study politics and to the way politicians act out politics. However, despite 
these exceptions, much of the literature that I now review has to deal with what 
anthropologist Verdery (1991) in her study of Ceausescu's Romania calls the 
“Is anyone listening problem?" 


Politics and Cultural Objects 


Griswold's (1987) definition of "cultural object" as "significance embodied 
in form" is a useful label for the range of entities that this section addresses. 
With politics lagging behind sociology and history, Edelman (1995) has written 
a monograph signaling the importance of art to politics. Within sociology, 
Goldfarb (1980, 1982) was one of the earliest contributors to this literature 
when, before 1989, he studied Polish student theater as a vehicle of dissent and 
the different enactments of cultural freedom in the United States and Poland. 
Haraszti (1987) studied how Hungarian writers under state socialism managed 
to produce in spite of state censorship. Corse (1995, 1997) has asked how 
literary canons are created in the service of national identities by comparing 
novel production in the United States and Canada. Beisel (1993) shows that 
art censorship is more a product of the class position of the patrons than of any 
objective qualities of pure and impure art. 

The relationship between form, content, and meaning is an emerging theme 
in studies of cultural politics. Burns's (1992) study of the ideological pro- 
nouncements of Catholic bishops and the Comaroffs' (1991) study of religious 
conversion in South Africa argue that 1t was the rhetorical strategies, rather than 
content, that were mobilized in the service of ideological persuasion. Berezin 
(1994b) asks how fascist theater conveyed fascist meaning if theatrical content 
appeared not to contain fascist ideology. She argues that it was the formal 
properties of theater, staging and style, that communicated fascist meaning. 
Mukerji (1994) argues that the design of seventeenth-century French formal 
gardens was the aesthetic parallel of the geopolitical maneuvering of a state 
bent on territorial aggrandizement. In a similar vein, Cerulo's (1995) study 
of anthems and flags as national symbols argues that it is the syntax or the 
symbolic structure, rather than explicit content, that conveys meaning. 

À more textualized approach to the study of art and politics is found in 
Bowler's (1991) and Falasca-Zamponi's (1995) studies of Italian futurism and 
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fascism. Both articles use texts to explore the relation between an aesthetic 
and political movement. Wagner-Pacifici’s (1986) study of the kidnapping and 
murder of Italian prime minister Aldo Moro borrows Victor Turner’s notion 
of liminality to discuss this event as a dramatic genre. Her study of an urban 
bombing in Philadelphia (1994) continues this type of analysis by focusing on 
strategic discourse. 

Cultural objects are more than simply aesthetic. Scholars are beginning to 
look to constitutions as national narratives (Arjomand 1992, Hart & Stimson 
1993, Norton 1992). Spillman (1996) examines constitutions as vehicles of na- 
tional identity by comparing debate at constitutional conventions in the United 
States and Australia. History as cultural object is the subject of work on memory 
and politics (Halbwachs 1980, Connerton 1989). Beginning with the influential 
anthology by Hobsbawm & Ranger (1983) on the “invention of tradition,” the 
constructed past has become a fertile subfield of political and cultural analysis. 
Much of this work has been on material objects of memory such as monuments 
and museums (Gillis 1994), with exceptions (Golden 1988). Schwartz (1991, 
1996) and Fine (1996) have contributed a range of studies on the historic re- 

‘construction of American Presidents. Schudson’s (1994) study of Watergate 
and the study by Schwartz and Wagner-Pacifici (1991) of the Vietnam War 
Memorial in Washington, DC, have discussed how a political event of dubious 
significance shaped American political commitment and confidence. The study 
of past events that nation-states would prefer to forget has been the object of 
much analysis that focuses upon Europe. Studies of the Nazi Holocaust (Maier 
1988, Friedlander 1992), French collaboration (Rousso 1991), and the fall of 
Italian fascism (Berezin 1996) have focused upon how the memory of a morally 
questionable past affects current political practices. 


Ritual and Symbolic Processes 


Students of ritual, broadly conceived to include repeated expressive and sym- 
bolic actions in public spaces, have contributed much to the study of politics. 
Here, as in the study of political culture, we can identify an older and a newer 
approach. Edelman (1985, 1988) was among the first in social science to argue 
that there was an important symbolic dimension to political action. Within 
anthropology, Geertz’s work (1980) on the Negara as theater state and the an- 
thropology of power (1983) lent this mode of analysis legitimacy and elegance. 
Geertz’s work is seamless—its virtue and its limitation (for example, Cannadine 
1987, Ozouf 1989, Von Geldern 1993, Davis 1975, Lane 1981). 

The new studies of ritual drvide in two ways. Those that treat it as a process 
of the production of belief, ideology, or identity (Kertzer 1988) and those that 
look at itas a narrative of domination (Scott 1990). Kertzer’s (1996) study of the 
reinvention of the Italian Communist party looks to party history and national 
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politics to explain shifts in the meaning of the naming of the party. Berezin 
(1997a) examines Italian fascist ritual and reveals that the fascist identity project 
was an attempt to rewrite the rules of cultural game through the appropriation 
of the Italian cultural values of family and the popular practices of Roman 
Catholicism. Fascist ritual was a vehicle of subversion and solidarity, and 
sometimes both simultaneously, but never a narrative of identity. Laba’s (1991) 
discussion of the "sacred politics” of the Polish Solidarity movement, Burawoy 
& Luckacs’s (1992) description of “painting socialism,’ Gal’s (1991) discussion 
of the reburying of Bartok’s body in Hungary, and Gilmore’s (1993) study of 
Andalusian carnival pursue a similar line of analysis. These works start with 
a historical problem, fascism, communism, or socialism, and look to ritual to 
link cultural conditions to ideology and political practice. They pay careful 
attention to historical context, and context is as important as any single event 
that they seek to describe. 

In contrast to this productivist approach to ritual, there is an evolving and 
important Durkheimian strand of analysis (Alexander 1988a,b, Rothenbuhler 
1988, Tiryakian 1988). Lukes (1975) has criticized the unreconstructed 
Durkheimian position that sees ritual as purely a dramatization of the social 
order. Moore & Meyerhoff (1977) attacked the codification of politics into 
sacred and profane. The new Durkheimianism emphasizes the complexity of 
ritual analysis and in some instances incorporates Foucault. This approach 
starts with the ritual events themselves and works backward from that point 
(P Smith 1991, 1996, Ben-Amos & Ben-Ari 1995). This work combines a 
Durkheimian idea of ritual with Foucauldian notion of power and domination. 
Ritual becomes an expressive vehicle of the power to discipline and punish or 
to describe or inscribe historical events and meanings. 


COLLECTIVE ACTION 


The study of collective action, which had been dominated by structural ap- 
proaches, is taking a decidedly culturalist turn. New work on the study of 
revolutions and social movements attempts to understand the role that culture 
plays in political actions. As in the institutions literature, culture is a new 
preoccupation for many students of social movements. 

Within both history and sociology, Hunt’s Politics, Culture and Class in 
the French Revolution (1984) and Sewell’s Work and Revolution in France 
(1980) legitimated the importance of culture for the study of revolutions and 
momentous political events. However, for the past 18 years, Skocpol's States 
and Social Revolutions (1979) has set the research agenda within sociology. 
Skocpol (1985) and Sewell (1985) had a frequently cited debate on the role of 
ideology in social revolutions. Sewell called for an integrative approach to the 
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study of culture and politics; and Skocpol replied that “cultural idioms” invoked 
at particular points in time were “probably” useful in enhancing the study of 
political outcomes. Given Skocpol’s leading role in the study of revolutions 
for the past 18 years, it is not surprising that two of the spokespersons for 
a more cultural account of revolution are her students, Goldstone (1991a,b) 
and Goodwin (1994). Goldstone and Goodwin began as structuralists, so they 
argue that culture must be incorporated as a form of structural variable into 
cultural analysis. Other persons working in this area began with a primary 
interest in culture, and they tend to focus upon the role of ideological discourse 
in revolutions (Moaddel 1992b, Sewell 1994, Burns 1996). 

Social movements have always attracted scholars who were attentive to the 
role of culture and collective mobilization (Gusfield 1963, Klatch 1987). For 
example, Rieder (1985) found that ethnicity and the cultural visions it gener- 
ated contributed to the different mobilization patterns among members of the 
Jewish and Italian lower middle class community that he studied. An influ- 
ential volume of Social Research (Volume 52, 1985) devoted to new social 
movements (Cohen 1985) and an article by Snow et al (1986) that introduced 
the concept of “framing” initiated the cultural turn in social movement theory 
and research. The term “new social movements” was coined to label the move- 
ments that emerged in the post-student movement period in Europe (roughly 
1969 to 1977); these were about issues of identity (women’s and gay rights) 
and values (peace, environment). Although Calhoun (1993b) has claimed that 
these movements did have historical precedents, the label stuck as a way to deal 
with movements that were not based on claims for material advantage (such as 
labor movements, or even voting rights movements). The interest in new social 
movements introduced culture or cultural claims to the study of social move- 
ments (Beisel 1990, Jasper & Nelkin 1992, Eder 1993, Eyerman & Jamison 
1991, Moaddel 1992a, Melucci 1996). 

Snow's work (Snow et al 1986, 1988, 1992) as well as that of his associates 
(Benford & Hunt 1992, Guthrie 1995) captured the imagination of scholars 
who had traditionally worked in the field of social movements from a decidedly 
nonculturalist perspective (Tarrow 1992, 1994, Gamson 1988, 1991, Gamson 
& Modigliani 1989, Gamson & Stuart 1992). In Snow's original formulation, 
a frame was a bounded and easily recognizable narrative package that activists 
could impose on events. The more successful a movement was, the more likely 
it was able to tap into these deeply resonant preset frameworks (Zald 1996, 
Johnston et al 1994, McAdam 1994, 1996). 'The problem with frame analysis 
is that while its boundedness appeals to those who started out as structuralists, 
it is overly rigid to persons who have a more fine-grained sense of cultural and 
historical analysis. An interest in rhetorical strategies or language (Blain 1994, 
Ellingson 1995) and political mobilization is beginning to emerge and will be 
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a major competitive paradigm to frame analysis. The critique by Emirbayer & 
Goodwin (1994) of network theory introduced a new pathway linking culture 
to social movements that has yet to be fully explored. 


THE ROAD AHEAD 


In general, the terms, “politics” and "culture" are somewhat underspecified and 
have different meanings for persons who label themselves as working in the 
field. Swidler (1986), who wrote possibly the most cited article in this entire 
area, and Sewell (1992) have exercised a distinct theoretical influence on recent 
work—particularly work with a historical orientation. Bourdieu's emphasis 
on boundary making (Lamont 1992, Lamont & Fournier 1993), Foucault's 
disciplinary mechanisms, and Habermas' conception of the public are setting 
the research agenda of scholars who focus on macro-level social change. The 
interest in politics and culture is driven as much by current history as by theory. 
The events of the last eight years in the former Eastern Europe and Soviet 
Union, as well as the concern with the weakening of political engagement and 
growing social inequality in the United States, together are fueling research in 
this area. 

Politics and culture has always been an interdisciplinary area. Promising 
directions for future work derive from theory, history, and anthropology, and 
these are already to some extent visible in the works that I have discussed. First, 
historical work with an ethnographic bent offers a promising method for mining 
the past to understand the present. Second, political communication—how 
citizens actually view themselves as political subjects coupled with a closer 
scrutiny of the public vehicles that disseminate political ideas—needs more 
attention. We know very little of how people think about politics and the 
language in which they express "ordinary" political beliefs. We also know 
very little about how the “public sphere” operates to shape ideas and practices. 
“Talking” about politics is an important focus of future work. Third, the field 
of social movements as it moves away from interest models and extends frame 
analysis offers a promising area to recast political mobilization in cultural terms. 
The rethinking of this area points us in the direction of emotion and politics 
(Goodwin 1997, Berezin 1997b)—to date, a neglected area that promises to 
bring social psychology and gender into the sociological study of politics and 
to refine and extend current discussion of identity. 

The terrain that I have mapped in this review is somewhat less fissured than 
the one I described three years ago. While it is easier to identify sub-areas 
of interest, paradigms have not yet crystalized, and there is still, as evidenced 
by the variety of works cited, a great deal of empirical eclecticism. The field 
is open to new ideas, new theoretical approaches, and new research methods. 
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Politics and culture is still a frontier, and frontiers by their nature are sites 
of innovation. The increasing numbers of researchers working in this area 
suggest that the next review of this area will reveal clear paths—most likely 
forged through the trends that I have identified. 
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ABSTRACT 

The study of identity forms a critical cornerstone within modern sociological 
thought. Introduced by the works of Cooley and Mead, identity studies have 
evolved and grown central to current sociological discourse. Microsociological 
perspectives dominated work published through the 1970s. Sociologists focused 
primarily on the formation of the “me,” exploring the ways in which interpersonal 
interactions mold an individual's sense of self. Recent literature constitutes an 
antithesis to such concerns. Many works refocus attention from the individual to 
the collective; others prioritize discourse over the systematic scrutiny of behavior; 
some researchers approach identity as a source of mobilization rather than a 
product of it; and the analysis of virtual identities now competes with research 
on identities established in the copresent world. This essay explores all such 
apenda as raised in key works published since 1980. I close with a look toward 
the future, suggesting trajectories aimed at synthesizing traditional and current 
concerns. 


INTRODUCTION 


The study of identity forms a critical cornerstone within modern sociologi- 
cal thought. Introduced by the works of Cooley and Mead, identity studies 
have evolved and grown central to current sociological discourse. Microsocio- 
logical perspectives (social psychology, symbolic interactionism), perspectives 
focused primarily on the individual, dominated work published through the 
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1970s.! Sociologists focused primarily on the formation of the “me,” exploring 
the ways in which interpersonal interactions mold an individual’s sense of self. 
But identity research of the past two decades proves antithetical to traditional 
concerns, a shift largely fueled by three important trends. 


1. Social and nationalist movements of the past three decades have shifted 
scholarly attention to issues of group agency and political action. Asa 
result, identity studies have been relocated to the site of the collective, with 
gender/sexuality, race/ethnicity, and class forming the “holy trinity” of the 
discursive field (Appiah & Gates 1995:1). Writings attend, in particular, to 
that which constitutes a collective and the political implications that result 
from collective definitions. 


2. Intellectual concerns with agency and self-direction have re-energized the 
study of identification processes. At the level of the collective, scholars are 
examining the mechanics by which distinctions are created, maintained, and 
changed. 


3. New communication technologies have freed interaction from the require- 
ments of physical copresence; these technologies have expanded the array 
of generalized others contributing to the construction of the self. Several 
research foci emerge from this development: the substance of "T," “me,” and 
the generalized other in a milieu void of place, the establishment of “com- 
munities of the mind," and the negotiation of copresent and cyberspace 
identities. 


This essay explores each of these research agenda. Because of the literature's 
expanse, I limit discussion to key works published since 1980. My review 
includes several nonsociological works, a strategy demanded by the makeup of 
this field. (I revisit this issue in the conciusion.) The essay closes with a look 
toward the future, as I suggest trajectories aimed at synthesizing traditional and 
current concerns. 


THE "NATURE" OF COLLECTIVE IDENTITY 


Collective identity is a concept grounded in classic sociological constructs: 
Durkheim's “collective conscience,’ Marx's "class consciousness,’ Weber's 
Verstehen, and Tonnies’ Gemeinschaft. So rooted, the notion addresses the 
“we-ness” of a group, stressing the similarities or shared attributes around 
which group members coalesce. Early literature approached these attributes 
as “natural” or "essential" characteristics—qualities emerging from physiolog- 
ical traits, psychological predispositions, regional features, or the properties 


! For recent directions 1n this literature, see Stryker (1992). 
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of structural locations. A collective’s members were believed to internalize 
these qualities, suggesting a unified, singular social experience, a single canvas 
against which social actors constructed a sense of self. 

Recent treatments of collective identity question the essentialism of collective 
attributes and images. Anti-essentialist inquiries promote the social construc- 
tion of identity as a more viable basis of the collective self. Other works stress 
the problems inherent in collective categorization, presenting a postmodern 
challenge to arguments of unified group experiences.” 


Social Constructionism: The Anti-Essentialist View 


In concert with theories of WI Thomas, Peter Berger, Erving Goffman, Howard 
Becker, and others, the social constructionist approach to identity rejects any 
category that sets forward essential or core features as the unique property of a 
collective’s members. From this perspective, every collective becomes a social 
artifact—an entity molded, refabricated, and mobilized in accord with reigning 
cultural scripts and centers of power. 

Social constructionism informs much of the work on gender identity. Such 
studies challenge essentialist dichotomies of gender and dismiss notions of 
gender’s primordial roots. Constructionists conceptualize gender as an interac- 
tional accomplishment, an identity continually renegotiated via linguistic ex- 
change and social performance. From this stance, researchers also explore sub- 
jective definitions of femininity and masculinity, attending to the symbols and 
norms that initiate and sustain either/or classifications (e.g. Bem 1993, Connell 
1995, Coser 1986, C Epstein 1988, Fausto-Sterling 1985, Gailey 1987, Gerson 
& Peiss 1985, Hearn 1992, Kupers 1993, Margolis 1985, Marshall 1991, Probyn 
1993, Richardson 1989, D Smith 1987, 1990, West & Zimmerman 1987, 
Wittenstrom 1995, Wittig 1981, 1986). Constructionist works often scrutinize 
agents of socialization, delineating their role in gender identity acquisition: the 
family (e.g. Caldera et al 1989, MacDonald & Parke 1986, Ross & Taylor 1989, 
Whiting & Edwards 1988), the schools (e.g. Best 1983, Cookson & Persell 
1985, AJ Davis 1984, Eder 1995, Gilligan 1990, Hyde & Linn 1988, Hyde 
et al 1990, Raissiguier 1994, Sadker & Sadker 1994, Thorne 1995, Trepanier 
& Romatowski 1985), popular culture and media (e.g. Atkin 1982, Eilberg- 
Schwartz & Doniger 1995, Gaines 1991, Kalisch & Kalisch 1984, Kaplan 1983, 
CL Miller 1987, Sidel 1991, Signorielli 1990, Signorielli & Morgan 1988). 
While such studies reflect numerous intellectual traditions, all are concerned 
with the ways in which socialization agents organize and project the affective, 
cognitive, and behavioral data individuals use to form a gendered self. 

An important outgrowth of gender constructionism rests with works that 
problematize the gender-sex link. Researchers dissect the differences in male 
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and female biology—the body and reproductive system in particular. By ques- 
tioning the meaning of biological distinctions, scholars expose the social rituals, 
symbols, and practices that transform such differences into social facts. In so 
doing, this literature demonstrates the inscription of gender on the body, si- 
multaneously dismantling notions of gender as emergent from the body (e.g. 
Acker 1989, Arditi et al 1985, Bartky 1988, Bordo 1993, Corea 1985, Martin 
1987, Medicine 1983, Nanda 1990, Oakley 1984, Papanek 1990, Sault 1994, 
Shilling 1993, Spallone & Steinberg 1987, Stacey & Thorne 1985). In high- 
lighting the subjective nature of gender, constructionists do not deemphasize 
the effects of gender categories. Rather, they argue that socially defined male- 
ness and femaleness severely constrict human behavior. Subjective definitions 
imprison individuals in spheres of prescribed action and expectation. Dorothy 
Smith writes of this effect within scholarship, dubbing it “the alienation of utter- 
ance... models of speaking, writing, and thinking that took (women's) powers 
of expression away from us even as we used them" (1990:199—201). Gender 
scripting attitudes, behaviors, emotions, and language, and treating these scripts 
as natural signals, ensures that social members both succumb to and recreate 
the “armor” of gender identity stereotypes. 

Constructionist approaches to sexual identity complement the gender liter- 
ature. Important entries include the work of Verta Taylor and Nancy Whittier 
(1992) on lesbian identity and lesbian social movements. Taylor & Whittier 
map a three-step process, itemizing (a) the construction of boundaries that both 
insulate and differentiate nonmainstream groups from the dominant sexual col- 
lective, (b) the emergence of shared consciousness and goals among nonmain- 
stream sexual groups, and (c) processes of politicization that valorize a group's 
minority status. Janice Irving (1994) takes a similar approach to adolescent sex- 
ual identity. Irvine identifies nine “axes of constructed domains" (p. 11) that 
contribute to identity building and identity-based experience: gender relations, 
sexual identities, reproductive strategies and behavior, sexual language and 
public discourse, the role of the family, nonreproductive sexuality, the purpose 
of sex and the role of pleasure, knowledge and meaning of the body, and sexual 
violence. Irving also explores the cultural differences that can color sexual 
identification as it occurs in varying social locations. These works exemplify 
a much broader literature exploring the origins, meaning, and renegotiation 
of sexual communities (e.g. D'Emilio 1983, Faderman 1981, Faraday 1981, 
Plummer 1981, Raymond 1994, E Stein 1992b, Troiden 1988, Whatley 1994). 

Race and ethnic studies represent another stronghold of constructionism. 
F. James Davis (1991), for example, provides a fascinating historical excursion 
that charts definitions of blackness in America. Davis documents the history 
of the "one drop rule,” a vehicle for racial classification. He unpacks the rule's 
development and highlights its triumph over competing alternatives. Davis 
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follows the one drop rule in action, itemizing its role in the struggle to main- 
tain the slave system. He also contrasts the US classification experience with 
racial categorization in other nations. Davis concludes with thoughtful specu- 
lation regarding the one drop rule's impact on the future of US race relations— 
particularly in light of developing demographic shifts. In another arena, Balibar 
& Wallerstein (1991) view racial identity within a broad analytical landscape, 
considering race in conjunction with nation and class. Blending constructionist 
premises with the socioeconomic lenses for which the authors are renowned, 
Balibar & Wallerstein thoughtfully analyze both imposed racialization and self- 
racialization, variantly considering racial identity and collective repression, the 
struggle for collective autonomy, and the search for collective shelter. 

Several works on racial and ethnic identity incorporate the subject's voice into 
their inquiries. Examples include Richard Alba's (1990) work on European- 
descended Americans. In keeping with constructionist premises, Alba argues 
that ethnic identity is no longer anchored in strongly ethnic social structures. 
Rather, he presents ethnicity as a symbolic entity “concerned with the sym- 
bols of ethnic cultures rather than with the cultures themselves” (1990:306). 
Alba argues that symbolic ethnicities are easily reshaped in response to vary- 
ing situational contexts and growing social needs. His data suggest one such 
reconstruction—a renegotiation that unites European descendents under the 
broad umbrella of a European-American identity. Alba argues that this identity 
shift bears significant social benefits for those it encompasses; the shift pro- 
vides white-Euro descendents with a larger, more comfortable base as they face 
a rapid influx of non-Euro, nonwhite immigrants. 

Identity shifts and their implications are also central to Mary Waters’s (1990) 
research. Like Alba, Waters brings forward a constructed, symbolic ethnicity. 
However, her work problematizes the relentlessness with which individuals 
cling to ethnicity. Waters scrutinizes ethnic identification in light of its social 
payoff—Trewards, she argues, that prove negligible for white, Euro-descendents 
and potentially negative for Americans of nonwhite, non-Euro lineage. Ulti- 
mately, Waters comes to understand ethnic identity as the product of personal 
choice—a social category individuals actively decide to adopt or stress. Her re- 
search documents the ways in which those of mixed ancestry switch and amend 
their primary ethnic affiliations. In this way, Waters locates the attraction of 
ethnicity within a double-edged American value. Commitment to ethnic iden- 
tity stems from a culturally based need for community—community lacking 
individual cost. 

In another arena, Joann Nagel (1995) examines ethnic identity shifts as a 
sociopolitical phenomenon. Using US Census figures (1960-1990), Nagel 
documents changing patterns of Native American identification. She explains 
identity shifts with reference to three sociopolitical factors: changing federal 
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Indian policy, increased American ethnic politics, and growing American Indian 
political activism. Nagel argues that these factors raised Native American ethnic 
consciousness, and she traces the ways in which policy and politics encouraged 
an ethnic renewal. (Also see Blakely 1993, Conzen et al 1992, DeVos 1992, 
Dyson 1993, Farley 1991, Frankenberg 1993, Hout & Goldstein 1994, Ignatiev 
1995, Jewell 1993, Shively 1992, Smedley 1993, Wade 1993, Williams 1990.) 

Social constructionism drives a multifaceted literature on national identity. 
A rich collection of sociohistorical works on commemoration, narrative, and 
symbolization chart the ways in which actors, particularly elites, create, ma- 
nipulate, or dismantle the identities of nations, citizenships, allies, and enemies 
(e.g. Agulhon 1981, Beaune 1991, Berezin 1997, Brubaker 1992, Corse 1996, 
Fine 1996, Gillis 1994, Griswold 1992, Hobsbaum 1992, Hobsbaum & Ranger 
1983, Kubik 1994, Lane 1981, Schudson 1992, Schwartz 1987, 1991, Spill- 
man 1997, Wagner-Pacifici & Schwartz 1991, Y Zerubavel 1995). In related 
work, several studies grapple with constructionist issues as they consider the 
constitution of the American self (e.g. Bellah et al 1985, 1991, Hewitt 1989, 
Meyer 1987). One also finds innovative elaborations of constructionism, such 
as Benedict Anderson’s (1991) work on imagined community. Anderson ap- 
proaches national identity as a sociocognitive construct—one both spatially 
and temporally inclusive, both enabled and shaped by broader social forces. 
He documents key moments of identity construction, times during which cul- 
tural (language) and social factors (capitalism, print technology) convene ın 
a particular historical moment, effectively remaking collective images of the 
national self (also see Bloom 1990). 

Yet, more than any arena before the identity scholar's eye, national iden- 
tity work presents a multivoiced excursion. Works probing nationalism with 
reference to the state and world markets continue to thrive (e.g. Armstrong 
1982, Gellner 1983, Giddens 1984, Tilly 1990). "New institutionalism" brings 
middle-range questions to bear, examining political structures and organiza- 
tional principles and their influence on policy, political agenda, and ultimately 
collective self-definition (e.g. Birnbaum 1988, Boli 1987, 1989, Jepperson & 
Meyer 1991, Skocpol 1985). Newer trends include Yasemin Soysal's (1994) 
“postnational model,” which addresses changing definitions of national mem- 
bership. Soysal examines the different strategies by which Western European 
nations incorporate guestworkers into the national citizenry. She argues that 
citizen collectives increasingly are defined not by their primordial ties to a terri- 
tory, but according to entitlements emerging from both a transnational discourse 
and a set of structures celebrating human rights (also see Shapiro & Alker 1995). 

Anthony D Smith (1991) poses perhaps the greatest challenge to construc- 
tionism. Smith adopts a middle-ground approach to national identity, linking 
social constructionism to more essentialist views. He defines national identity 
as a product of both "natural" continuity and conscious manipulation. Natural 
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continuity emerges from pre-existing ethnic identity and community; conscious 
manipulation is achieved via commemoration, ideology, and symbolism. Smith 
compliments this duality with a social psychological dimension, citing a "need 
for community" as integral to identity work. In Smith's view, this tri-part com- 
bination distinguishes national identity, making it the most fundamental and 
inclusive of collective identities. (Complimentary positions include Connor 
1990, Greenfield 1992, Hutchinson 1987. Calhoun 1993 and Hutchinson & 
Smith 1994 offer extensive literature reviews.) 


Postmodernism: Deconstructing Categories 


While supporting the antiessentialism that drives constructionist inquiries on 
identity, postmodernists cite serious flaws in the school's approach. Some find 
constructionism's agenda insufficient, suggesting that it simply catalogues the 
identity construction process. Further, many contend that the constructionist ap- 
proach implies identity categories built through interactive effort. Such a stance 
underemphasizes the role of power in the classification process (e.g. Connell 
1987, Gilman 1985), mistakenly suggesting “a multidirectional flow of influ- 
ence and agency” (Calhoun 1995:199). These weaknesses leave postmodern 
identity theorists skeptical of social constructionism's trajectory, fearing that 
the paradigm ultimately approximates the very essentialism it fights against. 
Diane Fuss elaborates in evaluating the constructionist approach to gender: 
"specifying more precisely these subcategories of ‘woman’ does not neces- 
sarily preclude essentialism. ‘French bourgeois woman’ or ‘Anglo-American 
lesbian, while crucially emphasizing in their very specificity that ‘woman’ is 
by no means a monolithic category, nonetheless reinscribe an essentialist logic 
at the very level of historicism" (Fuss 1989:20). 

In an effort to broaden the social constructionist agenda, postmodernists ex- 
amine the “real, present day political and other reasons why essentialist identi- 
ties continue to be invoked and often deeply felt" (Calhoun 1995:199). Further, 
in the study of identity, they view the variation within identity categories—.e. 
women, African-Americans, working class—as important as the variation be- 
tween identity categories. Finally, postmodernists advocate a shift in analytic 
focus, deemphasizing observation and deduction and elevating concerns with 
public discourse. In the spirit of Jean Baudrillard, Jacques Derrida, Michel 
Foucault, and Jean-Francois Lyotard, the postmodern-identity scholar decon- 
structs established identity categories and their accompanying rhetoric in an 
effort to explore the full range of "being." Works in this tradition call into ques- 
tion models that equate discourse with truth; they expose the ways in which 
discourse objectified as truth both forms and sustains collective definitions, 
social arrangements, and hierarchies of power. 

For students of identity, pcstmodern works on gender and sexuality prove 
richest. Judith Butler (1990), Patricia Hill Collins (1991), Jane Flax (1990), 
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Marjorie Garber (1992), Donna Haraway (1991), bell hooks (1984), and Trinh 
T Minh-Ha (1989) reconsider gender identity, giving voice to women of color, 
those of various social classes, and lesbian and bisexual women. They expose 
the dangers in approaching gender collectives as homogeneous entities and urge 
careful consideration of the complex, often contradictory, nature of collective 
existence. In contrast to the social constructionist, postmodern gender theorists 
challenge the dualistic, oppositional nature by which gender is traditionally 
framed. Patricia Hill Collins (1991), for example, notes that elements such 
as race and social class produce multiple variations of “women” and “men,” 
distinctions that many societies use to build complex hierarchical stratification 
systems. The existence of these multiple categories alerts us to the flaws of 
binary gender conceptualizations, focusing us instead on the ways in which 
multiple identity affiliations qualitatively change the nature of human experi- 
ence (see also Agger 1993, Baca Zinn & Thornton Dill 1994, Fraser 1989, Leps 
1992, Nicholson 1990, Nicholson & Seidman 1995, Raissiguier 1994, Riley 
1988). 

Core postmodern works on sexual identity follow a similar thrust, problema- 
tizing sexual categories and contesting sexual hierarchies (Butler 1993, Connell 
1987, Sedgwick 1990, Seidman 1992). In contrast to prior treatments of sexual 
identity (i.e. studies exploring the construction of a particular sexual identity 
or community—e.g. homosexual, lesbian, etc), current “queer theories” advo- 
cate an inclusive approach. They suggest, first, simultaneous considerations of 
heterosexual and homosexual identity construction, and second, serious focus 
on identities excluded by the hetero/homo duality—e.g. bisexual or transgen- 
der identities (also see De Lauretis 1984, S. Epstein 1992, Stein 1992, Warner 
1991). These basic tenets carry a provocative methodology. Queer theorists 
advocate a new “reading” of materials using an "inside/outside" opposition. 
In such an analysis, the reader must assume the connotation of homosexuality 
in the denotation of heterosexuality; the reader must reinterpret a product in 
terms of a homosexual presence. Further, the explication of a cultural product’s 
hetero/homo opposition must direct readers to all excluded forms of sexuality 
as well. Any work under analysis must be read with an eye for that which 
it itemizes and thus simultaneously implies. When queer theorists read tele- 
vision sitcom Laverne and Shirley (Doty 1993) or Alfred Hitchcock’s film 
Rope (DA Miller 1991) from a homosexual “subject-position,’ they contest 
current hierarchical structures of sexuality. Constructing alternative readings 
of a work's sexual implications deconstructs the taken-for-grantedness of the 
dominant sexual model. Reminiscent of Garfinkel's approach to identifying 
"invisible" normative structures, queer theory's subject-positioned readings jolt 
the very process of classification. (Agger 1991 and Fuchs & Ward 1994 offer 
recent, comprehensive reviews of postmodernism.) 
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Despite their differences, the issues raised by social constructionists and post- 
modernists alike direct scholarly attention tc acollective’s struggle to self-name, 
self-characterize, and claim social prerogative. Such concerns underscore the 
politics of identity. 


Identity Politics and Collective Mobilization 


Collective identity and the political movements it spurs constitute an important 
concern for identity scholars. Issues of politics are raised in previously cited 
works on gender/sexuality and race. Identity politics also provides the focus 
for many works on the formation and experiencing of social class (e.g. Blumin 
1989, Burawoy 1985, Carter 1985, Davidoff & Hall 1987, Dudley 1994, Eichar 
1989, Farrell 1993, Form 1985, 1995, Gallie 1983, Gans 1995, Garcia 1991, 
Halle 1984, 1993, Katz 1993, Katznelson & Weir 1985, Katznelson & Zolberg 
1986, McNall et al 1991, Vanneman & Cannon 1987, Wright 1989). How- 
ever, concerns with identity politics move scholars beyond this “holy trinity” of 
the discursive field. Animal protectionists, environmentalists, the health con- 
scious, the homeless, members of the 1960s student counterculture—in current 
literature, these groups too exemplify collectives moved by issues of collective 
definition, signification, and power. 

When moved by identity, collectives take on distinct properties. Spurred not 
by ideology or resource mobilization, identity-based movements act rather than 
react; they fight to expand freedom, not to achieve it; they mobilize for choice 
rather than emancipation. Alberto Melucci, central in this area, notes: 


The freedom to have which characterized. industrial society has been replaced by the 
freedom to be. The right to property has been, and remains, the basis of both industrial cap- 
italism and its competitor model, ‘real socialism’. In post-material society, there emerges 
a further type of right, the right to existence, or rather, to a more meaningful existence 
(1989:177—78; also see Calhoun 19912:51, Giddens 1991.207—17). 


In this way, identity politics creates "new social movements,” collective 
initiatives that are self-reflexive and sharply focused on the expressive actions 
of collective members (Melucci 1989:60, 1997). 

Identity politics and new social movements suggest a special form of agency— 
a self-conscious "collective agency." Identities emerge and movements ensue 
because collectives consciously coordinate action; group members consciously 
develop offenses and defenses, consciously insulate, differentiate, and mark, 
cooperate and compete, persuade and coerce. In such a context, agency en- 
compasses more than the control and transformation of one's social environ- 
ment. Rather, borrowing from Charles Taylor's discussion of agency and the 
self (1985:287), I suggest that collective agency includes a conscious sense 
of group as agent. Further, collective agency is enacted in a moral space. A 
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collective pursues the freedom to be because that which frames the collective’s 
identity defines their existence as right and good (Taylor 1989). 

In connection with issues of identity politics and collective agency, David 
Snow, Robert Benford, William Gamson, Doug McAdam, and others address 
the framing and schematization of identity as it occurs within social move- 
ments. Studies in this area delineate the frame alignment processes that bring 
both group focus and shared identity to particular collectives at specific his- 
torical moments. Further, these works itemize the ways in which resulting 
collective identities then direct movement participants by defining the param- 
eters and appropriate arenas of collective action. Such analyses remain fully 
mindful of the ways in which movement participants’ perceptions of history, 
social structures, and cultural arrangements constrain or enhance the interpre- 
tive processes. This work’s appeal rests in its multifaceted theoretical base. 
By merging sociocognitive construction processes with concerns for structural 
and organization factors, this literature creates an exciting conceptual bridge, 
linking micro and macro analyses, linking cultural and social concerns. Such 
works provide a model that forces the simultaneous consideration of thought, 
articulation, and action (see e.g. Benford 1993, Benford & Hunt 1992, Cohen 
1985, Fantasia 1988, Gamson 1992, Gamson et al 1982, Gerhards & Rucht 
1992, Hunt et al 1994, Jasper & Nelkin 1992, Johnston 1991, Johnston et al 
1994, Larana et al 1994, Lichterman 1996, McAdam 1982, 1988, 1994, Morris 
& Muellar 1992, Snow et al 1986, Snow & Benford 1992, Tarrow 1992, Taylor 
& Whittier 1992). 

Recent literature raises concerns regarding the long-term social consequences 
of identity politics. Michael J Piore (1995), for example, writes of identity- 
based movements as isolated, cohesive “communities of meaning.” Because 
such groups are narrowly focused and formed relative to distinctions, Piore 
argues that they find themselves incapable of cross-boundary exchange. Fur- 
ther, he believes such groups often remain unaware of the economic conditions 
that may constrain their collective goals. Based on these observations, Piore 
locates identity movements in America’s ideological roots of individualism. 
He suggests that current socioeconomic conditions beckon a change in this 
stance. Using cognitive theories derived from sociology and anthropology, 
Piore presents a five-step plan aimed at replacing identity politics with a shared 
commitment to a unified national structure. 


IDENTIFICATION PROCESSES 


Attention to collectives and the establishment of their identities has re-energized 
scholarly interests in the identification process itself. A growing literature ex- 
plores the mechanics by which collectives create distinctions, establish 
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hierarchies, and renegotiate rules of inclusion. Such works are closely linked 
to important knowledge theories, including Bourdieu's theories of distinction, 
Derrida's focus on difference, Foucault's genealogy of epistomes, the semiotic 
models of Saussure and Pierce, and Zerubavel's work on sociomental classifi- 
cation. This section elaborates on a variety of identification processes currently 
under study. 

Michele Lamont (1992) documents the role of symbolic boundaries in the 
construction of valued identities. Using rich data drawn from interviews with 
upper-middleclass men in France and the United States, Lamont specifies the 
conditions under which moral, socioeconomic, and cultural boundaries suc- 
cessfully create objective conditions of socioeconomic inequality. In contrast 
to Bourdieu, Lamont maintains a tri-part focus, demonstrating that the impor- 
tance of boundary types varies across space and time. Her work also empha- 
sizes boundary strength; her findings indicate that only those boundaries firmly 
grounded in widely shared meaning prove sufficiently strong to generate hierar- 
chy and confer relative value to collective identities (Lamont 1995, 1997; also 
see Cohen 1986, Lamont & Fournier 1992, Sahlins 1989). In related work, Jill 
Quadagno & Catherine Forbes (1995) use cultural and social lenses to explore 
identification and distinction among US Job Corps participants. The authors 
examine the workings of both symbolic boundaries and structural barriers as 
these factors contribute to gender reproduction and gender inequality. Concerns 
with symbolic boundaries characterize recent inquiries in life course research 
as well. Works by Gaines (1991), Jeffreys (1989), Modell (1989), Postman 
(1982), Waksler (1991), Winn (1983), and Zelizer (1985) help to refocus soci- 
ologists on the cultural contexts within which age categories are constructed, 
age identities are built, and age transitions occur. 

Margaret Somers (1994) and Harrison White (1992) approach issues of iden- 
tification by specifying the cultural repertoires or systems of meaning that char- 
acterize various symbolic communities. These authors are especially concerned 
with the ways in which social context and social location enable the invoking 
of such repertoires. In this regard, Paul DiMaggio's (1982, 1987, 1992) land- 
mark research on the arts also proves 1mportant. DiMaggio demonstrates the 
ways in which art acquisition and classification solidify status categories and 
distinguish the elite from the ordinary. 

In a different arena, Jane Bachnik & Charles Quinn (1994) focus on index- 
icality and its role in the construction of Japanese identity. Building on the 
Japanese concepts “uchi” (inside) and "soto" (outside), various authors explore 
the ways in which these boundary distinctions direct collective orientation and 
pattern behavior. In probing identity, all authors consider both linguistic com- 
munication and social practice through their linkage to context. Elsewhere, 
Eviatar Zerubavel (19972) offers a special edition that brings his concepts of 
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lumping and splitting to life. Authors chart these complementary sociomental 
processes as they explore identification and distinction in monetary exchange, 
fetal classification, the construction of sexual identities, and other interesting 
areas (also see E Zerubavel 1991, 1997b). Among social psychologists, John 
C Turner presents depersonalization as a process enabling collective identities. 
His work maps the ways in which depersonalization permits social stereotyping, 
group cohesiveness, ethnocentrism, cooperation and altruism, emotional con- 
tagion and empathy, collective action, and other processes (Turner et al 1987, 
1994). Finally, James Aho (1994) invokes Berger & Luckman’s five steps of 
reification to elaborate both the construction and deconstruction of political 
enemies. Aho situates his analysis in a variety of recent incidents, including 
the Ruby Ridge affair, the dissolution of the Soviet Union, and a case study of a 
KKK defection. Using media reports, government documents, and interviews, 
Aho carefully explores the development of collective moral righteousness. He 
notes the processes by which such sentiments legitimate the destruction of the 
“other.” Grappling with the question “Can one struggle effectively against evil 
without become tainted by it?’, Aho builds an interesting case for an enemy 
that is both “them” and “us.” 

The study of objects also proves key to recent research on identification. 
Several works note the ways in which individuals and groups use art ob- 
jects (Martorella 1989), commodities and commodity signs (Appadurai 1986, 
Goldman 1992, Hennion & Meadel 1993, O' Barr 1994), orclothing (Rubenstein 
1995) to articulate and project identities. Complimenting this agenda, Dauber's 
(1992) work on Pueblo pottery, Mukerji's (1994, 1997) studies of formal French 
gardens, and Zukin's (1991) exploration of city structures use objects to better 
understand the political, cultural, social, and economic contexts 1n which the 
objects are produced. Finally, Nippert-Eng (1996) and Silver (1996) offer es- 
pecially interesting excursions on the role of personal possessions in bridging 
identity transitions. 

Several works, anchored in the study of discourse and symbolization, pro- 
vide a multitiered analysis of collective identification and the ideologies that 
support it. In contrast to deterministic theories, such studies approach identi- 
fication as a process that unfolds in relation to economic, historical, and polit- 
ical contexts. Ás such, multitiered investigations view identification at critical 
junctures in a collective's history, including periods of identity production, 
its institutionalization, and periods of identity interpretation. Further, these 
works cast identity discourse and symbols as mediators of structure and action. 
Robert Wuthnow's (1989) work on communities of discourse exemplifies the 
approach. Wuthnow explores the general cultural, political, and economic con- 
ditions that enabled three specific ideologies: the Protestant Reformation, the 
Enlightenment, and Marxist socialism. After charting the historical sources of 
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these agendas, Wuthnow traces the modes by which each ideology was institu- 
tionalized, taking readers through the schools, religious and scientific groups, 
and the governments and media that modeled and disseminated each perspec- 
tive. Finally, Wuthnow examines audience reception and reaction, analyzing 
the collective application and experience of the three ideologies through the 
decades. Wuthnow's rich inquiry demonstrates the complex chain by which 
movement ideologies and resulting identities are both born and sustained. 

Following Wuthnow's example, Valentine Moghadam (1994) adopts the 
multitiered approach, analyzing fundamentalist discourse and its relationship 
to gender and national identities. Other multitiered works include Michele 
Dillon's (1993, 1996) studies of discourse on divorce and abortion. Her works 
explore the ways in which political climate and the institutional status of dis- 
course producers relates to both the nature and the effect of discursive strategies. 
In a similar vein, David Campbell (1992) examines the American "discourse 
of danger," exploring the ways in which such a discourse shapes visions of the 
American “us,” the enemy "them," and ultimately, the form of American foreign 
policy. Cerulo (1995), too, applies a multitiered perspective in her study of na- 
tional identity symbolization. Probing the various contexts of anthem and flag 
adoption, she identifies a set of social structural variables that appear to delimit 
general rules of symbolic expression. Cerulo also explores the institutional- 
ization of identity symbols, suggesting a new theoretical model for predicting 
symbol change. Finally, her inquiries on symbol reception and interpretation 
elucidate faulty identity symbolization, thus specifying the conditions under 
which symbols can fail to capture the fervency of those they portray. 


TECHNOLOGY AND IDENTITY 


In the present environment, one cannot consider identity without reference to 
new communication technologies (NCTs). NCTs have changed the backdrop 
against which identity is constructed; they have reframed the generalized others 
and the "generalized elsewheres" (Meyrowitz 1989) from which the self takes 
its cues. 

Joshua Meyrowitz (1985, 1989, 1997) was among the first to fully explore the 
NCT/dentity link. His works examine the ways in which electronic media re- 
organize the sites of social interaction. According to Meyrowitz, NCTs weaken 
or sever the connections between physical and social “place.” In this way, NCTs 
locate the self in new hybrid arenas of action; they mesh public and private, 
beckon new types of performances, and form new collective configurations: 


Television has fostered the rise of hundreds of "minorities" —people who in perceiving a 
wider world, begin to see themselves as unfairly isolated m some pocket of it. Television has 
empowered the disabled and the disenfranchised by giving them access to social information 
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in spite of their physical isolation Television has given women an outside view of their 
incarceration in the home Television has weakened visible authorities by destroying the 
distance and mystery that once enhanced their aura and prestige And television has been 
able to do this without requiring the disabled to leave their wheelchairs, without asking the 
housewife to stop cooking dinner, and without demanding that the average citizen leave his 
or her easy chair (1986:309) 


According to Meyrowitz, the places enabled by NCTs reconfigure the bound- 
aries that distinguish collectives. Thus at the present social moment, the differ- 
ences dividing children from adults may be less stark than those that distinguish 
the computer literate from the nonliterate. The line that separates home from 
work may now pale in comparison to online/offline borders. 

David Altheide (1995) poses complementary positions in his work on NCTs 
and the self. He argues that NCTs enable new communication formats—new 
modes of selecting, organizing, and presenting information. In turn, these new 
formats reshape social activity; they modify or dismantle current practices, and 
spur or shape new ones. In this way, NCTs create new environments for self- 
development and identification; they present new opportunities for collective 
affiliation and mobilization. Altheide’s writings on keyboard technology illus- 
trate these ideas. In considering human experience with telephones, ATMs, 
computers, video games, calculators, and television controls, the author casts 
the keyboard as a new door to interaction. He argues that keyboard technol- 
ogy initiates a large majority of modern exchange. But while keyboards may 
transport us to places not easily accessed in the past, Altheide argues that the 
technology limits and directs the form and substance of the social interaction 
it enables. For example, keyboard technology reduces the distance between 
children and adults, often promoting a type of reverse socialization. Similarly, 
keyboard technology homogenizes work and play sites; keyboards merge adult 
work and play worlds, Jink adult workspaces and children’s play spaces, and 
reconfigure children’s play via the world of adult tools (1995: chaps. 2, 3). 
Beyond specific communication tools, Altheide also explores the intersection 
of communication, power, and social control. He considers the ways in which 
technology and its resulting communication formats provide some with the 
power to define a social situation while leaving others vulnerable to the real- 
ity of crafted images. In these and similar discussions, Altheide outlines the 
ways in which communication formats can “block” the social stage, scripting 
emergent action even in realms thought to possess an internal logic. 

In the spirit of Meyrowitz and Altheide, many explore the impact of NCTs on 
community formation and resulting collective identity. Beniger (1987) initiated 
this agenda, suggesting specific ways in which media-generated communities 
provide a “pseudo-gemeinschaft’ experience. Subsequent works describe tech- 
nologically generated communities as more tangible and real. Several authors 
document the processes by which NCT's build *we-ness;" demonstrating the 
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concrete effects of techno-links and charting emergent cultures of reference that 
can unify once disparate social actors (Cerulo et al 1992, Cerulo & Ruane 1997, 
Dayan 1992, Liebes 1990, Purcell 1997, Steuer 1992, Tichi 1991). But others 
are less enthusiastic regarding the impact of NCTs on community and identity. 
Schlesinger (1993), for example, notes both the potential and the surprising fail- 
ure of electronic media in constructing a unified European identity among the 
nations that comprise the Euro-community (also see Morley & Robins 1995), 
Similarly, Fisher's (1992) sociohistorical research on the telephone suggests 
that community structure can be remarkably resilient to technological change. 
Overall, Fisher (1997) remains skeptical of NCT's ability to reconfigure so- 
cial bonds (also see Postman 1992). These contradictory positions have led 
some to initiate multidimensional models, models designed to address and dis- 
tinguish varying forms of NCT-generated bonds (see Calhoun 1991b, Cerulo 
1997). 

Related works magnify identification processes within specific NCT do- 
mains. Sherry Turkle (1995), for example, explores online communities and 
their impact on personal identity construction. She follows members of a vir- 
tual community as they interact in "multi-user domains" (MUDS). Testimony 
of MUD members, along with Turkle's keen insights, provide a unique picture 
detailing the building and experiencing of online persona. Further, Turkle docu- 
ments the ways in which individuals negotiate online identities relative to other 
facets of the self. By probing the balance between “virtual” selves and "real" 
selves, Turkle's work forces us to question any perspective that places virtual 
experience second to the concrete. In another arena, Byron Reeves & Clifford 
Nass (1996) approach communication media as objects relevant to identity- 
building interactions. In essence, the authors find that media objects become a 
viable “other” in the building of self, and they outline the ways in which human- 
to-machine relationships mirror purely human relationships. Reeves & Nass 
discover, for instance, that people treat computers with female voices differ- 
ently than those with male voices; people are polite to computers even though 
they don't need to be; the size and movement of TV screen images affects 
physical responses and perceptions of personal bodyspace in ways identical 
to real-life motion. These patterns hold implications for identity studies and 
beyond. Indeed, relevant to communication efficacy, the authors suggest that 
the human brain has not sufficiently evolved relative to technology's rapid ad- 
vancement. Reeves & Nass suggest ways in which this knowledge can improve 
future technological products. 


CONCLUSION 


The literature here reviewed constitutes an antithesis to traditional identity stud- 
ies, an antithesis built upon several research fronts. The works cited here refocus 
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scholarly attentions from the individual to the collective. These works often 
prioritize discourse over the systematic scrutiny of behavior. Many studies ap- 
proach identity as a source of mobilization rather than a product of it. Finally, 
the analysis of virtual identities now competes with research on identities es- 
tablished in the copresent world. In considering the old and the new of identity, 
one finds a field ripe for synthesis. This section suggests potential avenues for 
synthesis, noting works that exemplify such efforts. The works chosen here are 
not “ideal types,” but rather suggestive models—models that illustrate what can 
be gained when the future is mined from careful reflection on past and present. 

Some have achieved a productive synthesis of identity work through the 
reconciliation of theory. In recent work on gender identity, for example, 
Judith Lorber (1994) synthesizes constructionist and postmodern concerns of 
the day with issues raised by sex role theorists and Marxist feminists of past 
decades. Her efforts result in an interesting theory that frames gender as a 
social institution—a free-standing entity that establishes patterns of expecta- 
tions, orders social processes, and drives social organizations. In another arena, 
Norbert Wiley (1995) merges Peircian and Meadian paradigms to form a neo- 
pragmatist view of the self—the self as a three-dimensional dialogue between 
“I” “you,” and “me.” Wiley then works with pragmatist notions of reflexivity 
and Durkheimian solidarity concepts to create a model of “a semiotic self,” a 
sui generis self resistant to social determination. In a historical era spurred by 
identity politics, Wiley argues that conceptions of a sui generis self may prove 
vital to the defense of democratic principles. 

Others attend to macro-micro linkages in promoting the cause of synthesis.? 
Indeed, several recent theoretical advances relevant to identity studies rest 
on successful macro-micro linkages: Bourdieu's work on habitus, Giddens's 
structuration theory, or Habermas's theories of communicative action. Fur- 
ther, a number of the works heretofore discussed have successfully estab- 
lished the links of which I speak. Research on social movements or sev- 
eral of the multitiered identity projects represent prime illustrations. Other 
macro-micro initiatives are underway. For example, determined to eradicate 
the micro-macro divide, Deidre Boden (1994) innovatively combines elements 
of each arena, thus building a unique analytic approach to organizational iden- 
tity, form, and function. Boden maps conversational exchanges across varied 
organizational settings, using these data to configure the structure of talk in 
organizations. She then examines talk structures as vehicles that constitute 
organizations, analyzing organizations as they emerge within daily interaction. 
Boden argues that "the ways in which organizational actors realize both the 


*Within sociology at large many advocate this path (Collins 1986, Huber 1991, Ritzer 1990), 
in particular, with reference to issues addressing the intersection of the social and the cultural 
(Schudson 1989, Sewell 1992, Swidler 1986) 
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constraints and opportunities in their working environments is critical to what 
actually constitutes an ‘environment’ " (p. 32). For Boden, the micro-processes 
of talk become the macro-structure of the organization. (Snow 1987 offers a 
similarly interesting project on talk and identity.) In another arena, Cerulo 
(1997b) pursues macro-micro linkages in analyzing narratives of violence. She 
explores the institutionalization of storytelling formats in this area and doc- 
uments the impact such formats can have on audience evaluations of violent 
acts. In essence, her study elucidates the conditions under which macro-social 
norms of communication can direct complex micro-patterns of cognition and 
identification. Among social psychologists, Peggy Thoits & Lauren Virshup 
(1997) propose a macro-micro merger that hinges on social theories of the 
mind. Specifically, they suggest ways in which self-schema theory might unify 
inquiries on individual and collective identity. 

In the cause of synthesis, scholars of collective identity also might revisit 
traditional micro-level studies in the area. Much common ground exists bet- 
ween traditional and new approaches to the topic. For example, I noted earlier 
that collective identity scholars are currently exploring the ways in which mul- 
tiple identity affiliations qualitatively change the nature of human experience. 
During the 1980s, social psychologists’ addressed similar themes, focusing on 
multiple roles, their resulting identities, and the impact of both on human ex- 
perience (see e.g. Burke & Franzoi 1988, Stryker 1980, Thoits 1986). To be 
sure, collective and micro-level inquiries pursue different elements of “human 
experience.” Yet, knowledge of the cognitive processes, social practices, and 
symbolic tools with which identity 1s constructed, enacted, and projected 1s 
integral to each school of thought. Knowing this, focused efforts to translate 
certain findings from the social psychological realm to the macro level could 
hold rich rewards for collective identity studies. 

At the broadest level, an important site of synthesis rests in the careful blend- 
ing of intellectual perspectives. Within the past two decades, the humanist 
or cultural studies approach to identity has dominated the field. To be sure, 
sociologists of identity cannot afford to ignore these works, for they provide a 
rich and thorough treatment of the symbols, rituals, and world views that con- 
stitute identity. At the same time, the sociologist must consider this literature 
with some care. At present, the cultural studies position appears somewhat 
trapped in a singular conclusion that locates the constructed nature of culture 
in the sole service of power. Further, such works frequently frame social action 
as a process that is fully culturally constituted. In tapping identity materi- 
als garnered from the cultural studies approach, sociologists must diligently 
maintain the critical analytic distinction between the social and the cultural 
(see Schudson 1997). Via careful consideration of actors, collectives, and 
broader social institutions, via thoughtful attention to lived experience and the 
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cultural products and rituals associated with such experience, the sociology of 
identity can fully elucidate the intricate links between the social and cultural 


domains. 
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ABSTRACT 
Discussions of New Social Movements have sought to explain the apparent shift 
in the forms of contemporary social movements in Western nations by linking 
it to the rise of a postmodern world. However, the central propositions of the 
NSM paradigm have not been critically analyzed in terms of its concepts or the 
evidence. This review provides a critical analysis of the NSM thesis, finding that 
the central propositions are not defensible as a theory or a paradigm. 


INTRODUCTION 


The “New” Social Movement (NSM) paradigm is a recent addition to social 
theory that stresses both the macrohistorical and microhistorical elements of 
social movements. On the macro level, the NSM paradigm concentrates on 
the relationship between the rise of contemporary social movements and the 
larger economic structure, and on the role of culture in such movements. On 
the micro level, the paradigm is concerned with how issues of identity and 
personal behavior are bound up in social movements. The NSM paradigm 
offers a historically specific vision of social movements as associated with 
new forms of middle-class radicalism. It presents a distinctive view of social 
movements and of the larger sociopolitical environment, of how individuals fit 
into, respond to, and change the system. However, whether this nascent view 
qualifies as a cogent and empirically grounded paradigm has not been seriously 
examined. This review provides a critique of the central propositions of the 
NSM paradigm, assessing the evidence supporting its claims. 
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The NSM paradigm argues for a temporal, structurally linked understanding 
of social movements. Social movements are seen as being shaped and largely 
determined by social structure. In the industrial era, following a Marxist logic, 
social movements were believed to be centered in the working class. Working 
class movements were seen as instrumentally based actions concerned with 
matters of economic redistribution. Regardless of whether social movements 
_of the industrial era can be characterized in such categorical terms, it was the 
standard by which contemporary movements were compared. Contemporary 
movements (post-1965) were, however, not well explained by social theories 
that saw the working class as the site of revolutionary protest (Eyerman 1984, 
Olofsson 1988). In Europe, the defining events were the wide-scale student 
protests that took place in France and Berlin in 1968 and in Italy in 1969. In 
the United States, the rise of the student antiwar movement of the mid-1960s 
was seen as marking a similar radical departure from the past. In Europe, 
where Marxist theories of social movements dominated, Marxist theorists were 
unable to provide a convincing explanation for why students had become the 
vanguard of protest and why movement demands centered around quality of life 
rather than redistributive issues (Touraine 1971). With the predicted Marxist 
revolution not in sight, the shift of protest away from the working class, and the 
changing shape and form of protest in contemporary times, Marxist theorists 
saw the need to reformulate their ideas. While not all Marxist went in the same 
direction (see Boggs 1986), some of them postulated the NSM paradigm as 
an alternative (see Cohen 1985, Melucci 1980, 1985, Touraine 1977, 1981). 
In fact, much of the NSM discourse can be said to be a direct reaction to 
the perceived deficiencies of Marxism (Epstein 1990, Laclau & Mouffe 1985, 
Plotke 1990). 


NSM PARADIGM 


Although there are differing perspectives on NSMs (see Buechler 1995 for an 
overview), a set of core concepts and beliefs can be said to comprise the NSM 
paradigm. The central claims of the NSM paradigm are, first, that NSMs are 
a product of the shift to a postindustrial economy and, second, that NSMs are 
unique and, as such, different from social movements of the industrial age. 
NSMs are said to be a product of the postmaterial age (some refer to it as 
mature capitalism or postindustrialism) and are seen as fundamentally different 
from the working class movements of the industrial period (Olofsson 1988). 
NSM demands are believed to have moved away from the instrumental issues 
of industrialism to the quality of life issues of postmaterialism (Buechier 1995, 
Burklin 1984, Inglehart 1990, Parkin 1968). NSMs are, in short, qualitatively 
different (Melucci 1981). 
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However, when broadly related to contemporary movements, these observa- 
tions are essentially flawed. Just as the Marxist theories tended to marginalize 
protest that did not stem from the working class, so too have NSM theorists 
marginalized social movements that do not originate from the left. Contem- 
porary right-wing movements are not the subject of their focus. Thus, the 
NSM paradigm describes (at best) only a portion of the social movement uni- 
verse. But there is no a priori reason for eliminating conservative and counter 
movements from consideration. One might excuse the omission of counter- 
movements because they are believed to be reactions to insurgent movements, 
largely determined by the goals, ideology, tactics, and participants (in a negative 
way) of these (see Mottl 1980, Pichardo 1995, Zald & Useem 1987). However, 
this is not the case for movements, many of which over the last 20 years seem 
to be unique reactions (of a conservative character) to the alienating effects of 
postindustrial society. Some examples include the Christian Right and militia 
movements. However, the NSM paradigm is based solely on observations of 
left-wing movements and reflects this ideological bias undergirding the NSM 
paradigm. Yet, movements of both the left and the right are linked to changes 
in social structure. Understanding how other social groups perceive and react 
to these changes can only broaden our knowledge of social change. 

The NSM paradigm can only claim to explain left-wing movements of the 
modern age.! Among the movements typically studied by NSM researchers are 
the “urban social struggles, the environmental or ecology movements, women’s 
and gay liberation, the peace movement, and cultural revolt linked primarily to 
student and youth activism” (Boggs 1986:39-40). Is there any reason to believe 
that the populations supporting these movements, which are said to be a product 
of a fundamental change in the economic structure, should be affected while 
those populations supporting the militia, right-to-life, wise use, and Christian 
right movements should not be similarly affected? Although a possible rationale 
for this distinction could be constructed, the NSM perspective offers none, and 
its failure to do so marks a serious flaw ın its reasoning.’ It is not the purpose 
of this review to extend the NSM thesis to right-wing movements, although I 
shall offer some tentative suggestions in the conclusion. 

Putting this aside for now, let's review the NSM paradigm in terms of what 
it claims. As is typical of new paradigms in the process of establishing them- 
selves, their propositions are strongly stated (or perhaps overstated) to firmly 


lIn all fairness, most social movement theories are based on observations of left-wing move- 
ments, this 1s a broad failing of social movement research in general. Nonetheless, with the current 
growth of right-wing conservative movements in the modern era, it is perhaps especially pertinent 
to NSM theory to account for them as well 

?This refers specifically to why other social groups (especially conservative groups) do not 
engage in activism. 
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distinguish them from other theories. Subsequently, these strong propositions 
are reevaluated, and weaker propositions may be substituted. Strong statements 
may be hard to defend. This is especially the case for the NSM paradigm. Many 
of its original strong propositions have been found wanting. Therefore, I focus 
here primarily on the weaker versions. 


Are Contemporary Movements Unique? 


The first claim of the NSM paradigm is that contemporary movements are fun- 
damentally different in character than movements of the past. These differences 
are said to appear in the ideology and goals, tactics, structure, and participants 
of contemporary movements. 


IDEOLOGY AND GOALS The central factor characteristic of NSMs is their dis- 
tinct ideological outlook (Dalton et al 1990). It is from this difference that all 
others flow. The NSM paradigm states that contemporary social movements 
represent a fundamental break from industrial era movements. Rather than 
focusing on economic redistribution (as do working-class movements), NSMs 
emphasize quality of life and life-style concerns. Thus, NSMs question the 
wealth-oriented materialistic goals of industrial societies. They also call into 
question the structures of representative democracies that limit citizen input 
and participation in governance, instead advocating direct democracy, self- 
help groups, and cooperative styles of social organization. “The theme of the 
self-defense and democratization, raised implicitly (and sometimes explicitly) 
by the movements, ... [is] the most significant element in the contemporary 
struggle for democratization” (Cohen 1983:102). Taken together, the values of 
NSMs center on autonomy and identity (Offe 1985). 

In many ways, identity claims are the most distinctive feature of NSMs 
(Kauffman 1990), although all previous movements can also be described as ex- 
pressing identity claims (see e.g. Aronowitz 1992). The focus on identity is 
considered unique in modern movements because “identity politics also express 
the belief that identity itself—its elaboration, expression, or affirmation—is and 
should be a fundamental focus of political work. In this way, the politics of 
identity have led to an unprecedented politicization of previously nonpolitical 
terrains ..." (Kauffman 1990:67). This is expressed in the notion that “the 
personal ıs political" However, whether the politics of identity represents a 
liberation or stagnation of modern politics is a point of contention. The libera- 
tion of joining the personal with the political may represent a radical challenge 
to the hegemony of state domination, but it may also result in an “anti-politics 
of identity"—an apolitical withdrawal from politics (Kauffman 1990). 

However, little empirical work has examined the impact of identity claims 
on social movement participation, with two notable exceptions: Klandermans 
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(1994) has examined, in the Dutch peace movement, how the varying collective 
identities, as represented by the different organizational memberships, predicted 
the defection of participants from the movement. NA Pichardo, H Sullivan- 
Catlin, & G Deane (unpublished manuscript) have examined the role of personal 
identity in relation to participation in the environmental movement. Their find- 
ings show a significant, though not strong, association between self-reported 
environmental identity and participation both in conventional social movement 
activities (event participation, organizational membership, movement contri- 
butions) and in everyday behaviors (conserving energy and water, using alter- 
native transportation, and purchasing products made from recycled materials). 
Clearly, more empirical work on the connection between identity, at all its 
levels, and movement participation needs to be done. 

The other, supposedly unique, ideological feature of NSMs is its self-reflexive 
character. This means that participants are constantly questioning the meaning 
of what is being done (Cohen 1985, Gusfield 1994, Melucci 1994). This has 
led to conscious choices of structure and action—choices said to typify NSMs. 
The best example of this is the consciousness-raising groups characteristic 
of the feminist movement (Van der Gaag 1985; see also the volume edited 
by Katzenstein & Muller 1987). The decision to organize in a decentralized 
fashion, to operate under democratic principles with rotating leadership, is 
seen by some as consistent with Michel’s Iron Law of Oligarchy, whereby 
the goals of organizations become displaced (Kitschelt 1990). The unique 
ideological orientation and self-reflexive character largely dictate the kinds of 
tactics, structures, and participants evidenced in NSMs. 


TACTICS The tactics of NSMs mirror their ideological orientation. The belief 
in the unrepresentative character of modern democracies is consistent with its 
anti-institutional tactical orientation. NSMs prefer to remain outside of normal 
political channels, employing disruptive tactics and mobilizing public opinion 
to gain political leverage. They also tend to use highly dramatic and preplanned 
forms of demonstrations replete with costumes and symbolic representations 
(Tarrow 1994). 

This, however, does not mean that NSMs do not involve themselves in poli- 
tics, nor avoid becoming institutionalized themselves. As noted by Eder (1985), 
“NSMs manifest a form of middle-class protest which oscillates from moral 
crusade to political pressure group to social movement” (p. 881). Some NSMs 
have become integrated into the party system and gained regular access to regu- 
latory, implementation, and decision-making bodies, while others have formed 
political parties that regularly contest for electoral representation (Berry 1993, 
Hager 1993, Kitschelt 1990, Gelb & Paley 1982, Rochon 1990, Rochon & 
Mazmanian 1993, Tarrow 1990). A number of Green parties are prominent in 


416 PICHARDO 


Europe, with several having local manifestations in the United States (Bahro 
1986, Burklin 1982, 1985, Capra & Spretnak 1984, Hershey 1993, Kolinsky 
1989, Muller-Rommel 1985, 1990, Poguntke 1993). However, no direct cor- 
respondence appears between supporters of NSMs and those who vote for 
Green parties (Chandler & Siaroff 1986, Muller-Rommel 1985). So, the NSM 
paradigm recognizes that there is no truly distinctive tactical style of NSMs; 
rather, public opinion and anti-institutional politics have been recent and more 
prominent additions to the repertoire of social movements. 


STRUCTURE The anti-institutional posture of NSMs also extends to the way 
they organize. NSMs attempt to replicate in their own structures the type of 
representative government they desire. That is, they organize themselves in a 
fluid nonrigid style that avoids the dangers of oligarchization. They tend to 
rotate leadership, vote communally on all issues, and to have impermanent ad 
hoc organizations (Offe 1985). They also espouse an anti-bureaucratic posture, 
arguing against what they perceive as the dehumanizing character of modern 
bureaucracy. 


[NSMs] oppose the bureaucratization of society ın economics and politics that allegedly 
suffocates the ability of individual citizens to participate in the definition of collective 
goods and identities. Instead they call for a culturally libertarian transformation of social 
institutions that gives more leeway to individual choice and collective self-organization 
outside the economic commodity cycle or bureaucratic political organization (Kitschelt 
1993:15). 


Thus, they call for and create structures that are more responsive to the needs 
of the individuals—open, decentralized, nonhierarchical (Zimmerman 1987). 
Motivated by the lessons of the past, they hope to avoid becoming coopted or 
deradicalized. This is to claim, not that all NSMs are so organized, but that 
this form of organizing is more prevalent than in past times. The ideal-typical 
organizational style of NSMs should not be seen as reflecting the organizational 
styles of every NSM. Groups such as the National Organization of Women, and 
various environmental groups employ more traditional centralized, hierarchical 
forms of organization (Paley & Leif 1982, Shaiko 1993). 


PARTICIPANTS ‘There are two views on who the participants of NSMs are and 
why they join. The first places the base of support within the “new” middle 
class: a recently emerged social stratum employed in the nonproductive sectors 
of the economy (Cotgrove & Duff 1981, Lowe & Goyder 1983, Rudig 1988). 
Research on the rise of the new middle class within postindustrial society seems 
to establish the credibility of this social phenomenon (Brint 1994). But NSM 
theorists go a step beyond, by arguing that this stratum produces the chief 
participants of NSMs because they are not bound to the corporate profit motive 
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nor dependent on the corporate world for their sustenance. Instead, they tend 
to work in areas that are highly dependent upon state expenditures such as 
academia, the arts, and human service agencies, and they tend to be highly 
educated (Offe 1985). Another explanation posits that NSMs are the result of 
conflict over the control of work. 


In this conflict, the professionals whose control 1s based on expertise and skills are defending 
themselves against the encroachments on their work autonomy by colleagues who are 
primarily involved 1n the administration of the large private and public employers for 
whom the former work. Tending to lose out in this conflict, the skills and service-oriented 
professionals constitute a cruc:al structural potential for the new social movements, all 
of which attack in one way or another the unrestricted reign of technocracy (Kriesi et al 
1995:xix). 


However, there is recognition that within this overall descriptive categoriza- 
tion differences exist. 

The other view of the participants of NSMs is that they are not defined by class 
boundaries but are marked bv a common concern over social issues. It is an ide- 
ological, rather than ethnic, religious, or class-based community. In this light 
Arato & Cohen (1984) refer to the West German Greens as a “catch-all” party. 
They are defined by common values rather than a common structural location. 
Offe (1985) offers a slightly different view of who the NSM participants are. He 
argues that they are drawn from three sectors: the new middle class, elements 
of the old middle class (farmers, shop owners, and artisan-producers), and a 
“peripheral” population consisting of persons not heavily engaged in the labor 
market (students, housewives, and retired persons). A number of studies of 
the peace movement in various countries have demonstrated an equally diverse 
set of participants (Parkin 1968; see also the volume edited by Kaltefleiter 
& Pfaltzgraff 1985). Diani & Lodi (1988) show that within the Milan ecol- 
ogy movement, several different currents attract somewhat different sets of 
participants. 

However, neither view, in its narrowly defined sense, is supported by the ev- 
idence. Studies of environmental movements reveal that NSM participants are 
drawn primarily from two populations: The "new" middle class is one; the other 
is geographically bound communities that are being directly affected by the neg- 
ative externalities of industrial growth. Participants are the more ideologically 
committed middle class as well as communities that protest the siting of haz- 
ardous waste sites, landfills, and waste incinerators, or chemical and/or radiation 
poisoning of the local environment (see Apter & Sawa 1984, 1970, Opp 1988, 
Szasz 1994, Walsh 1981). The old middle class typically is also involved in re- 
gional issues (Touraine 19813. In short, the participants of environmental move- 
ments do not draw significantly from outside the white middle class unless there 
is some motivating, geographically based, grievance. For example, minority 
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communities have rarely participated in the environmental movement, except 
in protest over the placement of unwanted waste facilities (Bullard 1990).? The 
lack of minority participation is equally true of most other NSMs, including 
the animal rights, feminist, peace, and gay and lesbian movements. 

Whether middle-class participants engaged significantly in protest in the past 
is a critical question for the NSM paradigm. Such activity has been severely 
underestimated. Waves of middle-class protest have occurred since the early 
1800s in both Europe and the United States. These include the abolition, prohi- 
bition, suffrage, and progressive (as well as a number of nativist) movements. 
Thus, whether the middle class is only newly involved in social movements is 
indeed open to question. 

Are these proposed characteristics unique? If such differences exist, then it 
would mark a significant break from the past. The problem is that too many 
exceptions are cited. NSMs espouse open, democratic, nonhierarchical struc- 
tures, yet there are many NSM organizations that are not so characterized. They 
disdain institutional politics, yet many NSMs are regularly consulted by gov- 
ernmental bodies, and others have formed political parties. NSMs tend to draw 
from the new middle class, yet many community-based mobilizations (primar- 
ily environmental) have developed. Furthermore, the middle class is not a new 
site of social protest. NSMs tend to employ nontraditional tactics but also use 
those commonly employed by social movements of the past (lobbying, getting 
out the vote, court cases, etc). 

One cannot evade the fact that the striking feature of the contemporary situation of the 


movements is its heterogeneity. The old patterns of collective action certainly continue to 
exist In some movements they may even be statistically preponderant (Cohen 1985:665). 


Not only are the individual characteristics of movements in question, so too 
are those of the new social movements in general. D’ Anieri et al (1990) argue 
that there is no difference between utopian movements of the 1800s and present- 
day movements. In a similar vein, Calhoun (1991) sees no stark differences 
between movements of the nineteenth and twentieth centuries, arguing that 
social movements of the nineteenth century were not dominated by economistic 
organizations. 

The notion of movement “newness” has also been criticized in terms of 
whether contemporary movements represent a distinct break from movements 
of the past. For Touraine (1982), this radical departure is a function of their 
level of reflexivity and the changed locuses and stakes of the struggles. For 
Evers (1985), what is new about NSMs is that the “transformatory potential 
within new social movements is not political, but socio-cultural” (p. 49). That 
is, they aim to reappropriate society from the state. However, a host of other 


3This kind of placement has been referred to as environmental racism, a term grounded in the 
belief that decisions about locating hazardous waste sites ignore the interests of minority groups. 
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authors have stressed the continuity of NSMs with previous movements (Cohen 
1985, Crighton & Mason 1986, Eder 1985). Perhaps Melucci (1994) put the 
nature and importance of this controversy in the best light when he said it was 
futile. It ts not a question of deciding whether the empirical data observed are 
equivalent and comparable; instead, the question is whether their meaning and 
the place they occupy in the system of social relations can be considered to be 
the same (1994:105). 

Apart from whether NSMs represent breaks from previous movements, a 
related question is whether NSMs are new at all. As noted above, NSMs often 
contain elements of the old and new. Eder (1985) sees NSMs as embracing 
two types of phenomena: Cultural movements that “oppose present social life 
. . . [and] political movements [that] challenge modern state domination" (p. 5). 
Mouffe (1984) sees the novelty of NSMs not in the new antagonisms but in the 
“diffusion of social conflict into other areas and the politicization of more and 
more relations" (p. 141). 

However, some have been especially critical of the claims to originality of 
new social movements. Rather than arguing continuity or discontinuity with 
previous movements, Plotke (1990) criticizes the impact of NSM theory on the 
direction of social movements. He argues that accounts of a widespread “new 
social movement discourse" are wrong in crucial ways. Dubious claims about 
the movements, about what they are and should be, are politically significant 
because they are taken seriously both by analysts and by some participants 
within the movements" (p. 81). These “dubious claims” are an outgrowth of 
the theoretical conflict with Marxism, which has left NSM theorists "unpre- 
pared to engage in the theoretical and political debate in the United States, 
where neoconservative interpretations became dominant in the late 1970s and 
1980s" (p. 82). In other words, the debate over what's new is not iust an aca- 
demic polemic; it has real consequences for movements that may leave them 
vulnerable to counterattacks. 


Are Contemporary Movements a Product 
0f the Postindustrial Era? 


Related to the question of “what’s new" is a second issue—whether NSMs are a 
product of the shift to a postindustrial economy. There is disagreement over the 
exact nature of this relationship. Two schools of thought exist—one stresses an 
objective and the other, a subjective class position (see Eder 1985). The objec- 
tive school stresses social structural factors that formed “new” social classes as 
oppositional groups, while the subjective school of thought stresses attitudinal 
changes that have formed like-minded groups. Within the objective school, the 
first variation, what I call the "state intrusion" hypothesis, is a post-Marxist view 
that links the rise of NSMs to the changing requirements of capital accumula- 
tion in the postmodern age. With the advent of a service/technical economy 
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with its emphasis on growth and information management, capital accumula- 
tion necessitates social as well as economic domination. Social domination 
involves controlling dissent and knowledge (ensuring conformity) and there- 
fore requires an expansion of the state’s coercive mechanisms into the civic 
sphere (Habermas 1987, Melucci 1984, Sassoon 1984, Touraine 1971). NSMs 
are concerned with the “self-defense of ‘society’ against the state ... and the 
market economy ..." (Cohen 1985:664). Habermas (1981) refers to the pro- 
cess by which the state and market economy substitute strategic action for the 
symbolic processes of communication as "inner colonization." 

Mouffe (1984) offers a different version. She sees a similar process ex- 
cept that she links it to the commodification of social life (where social needs 
depend on the market for satisfaction), bureaucratization (resulting from the 
intervention of the state into all areas of social reproduction), and cultural mas- 
sification (resulting from the pervasive influence of the mass media that destroys 
or modifies existing collective identities). These new forms of subordination 
are responsible for the rise of NSMs, which represent novel forms of resistance. 

The unique nature of conflict in modern societies is said to be partly a function 
of three characteristics of domination and deprivation. First, the impact of the 
state and economy on society is said to no longer be class specific but “dispersed 
in time, space, and kind so as to affect virtually every member of society in a 
broad variety of ways" (Offe 1985:844). Second, there has been a deepening 
of domination and social control, making its effects more comprehensive and 
inescapable. Third, the political and economic institutions have lost the ability 
to correct their own defects (irreversibility) requiring action from outside the 
official political institutions to correct its flaws (Offe 1985). 

NSMs are believed to be a reaction to the state's attempts to control the civic 
sphere.^ For some, what is unique about NSMs is that they “emerge primarily 
outside the bourgeois public sphere . . . as extra-institutional phenomena rooted 
in civil society ... [that] point to a recovery of civil society” (Boggs 1986:47). 
Thus, we see movements that are concerned with cultural questions (involving 
matters of sexual identification, role definitions, and community). NSMs are 
also reactions to the expansive (growth oriented) nature of postindustrialism, 
which needs to neglect the social costs of growth to maintain profitability. This 
growth orientation has two principal consequences. First, it produces a mass 
consumer culture marked by mega-malls, strip malls, mass advertising, and 
planned obsolescence. Many NSMs reject this form of cultural manipulation. 
Second, the requirements of a mass consumer culture have negative environ- 
mental manifestations that are largely unwelcome and unwanted. These include 
garbage dumps, incinerators, and toxic pollution as well as the exploitation of 


4This should not be confused with the tactical choices of movements, such as the animal rights 
movement, that often attempt to mobilize the state to enter into the civic sphere to attain their goals, 
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the environment for the raw materials of industrial production. Thus, NSMs 
also represent quality of life concerns. Ultimately, the aim of participants in 
NSMs is “not to seize power in order to build a new world, but to regain power 
over their own lives by disengaging from the market rationality of productivism" 
(Gorz 1982:75). 

Some theorists have used the writings of Gramsci to illuminate the processes 
of cultural domination that the state employs to maintain power (see Carroll 
1992 for an overview). This "cultural Marxism” is a critique of “Marx’s con- 
cepts of the relations and forces of production for inadequate attention to the 
conscious experience of institutions and creative practical reasoning" (Weiner 
1982:13). This emphasis on the cultural basis of conflict in the modern era is 
imputed to be one of the defining characteristics of NSMs (Cohen 1983, Eder 
1985, Feher & Heller 1983, Gorz 1982, Melucci 1980, Scott 1990). The ide- 
ological hegemony of the state requires counter-hegemonic actions by social 
movements to dismantle the dominant sccial views that reinforce the legiti- 
macy of the capitalist system (Cohen 1983). Current notions, such as that "the 
personal is political,” that is, that everyday behavior has political ramifications 
and, by implication, supports the hegemony of the ruling classes, reflect the 
emphasis of NSMs on doing battle not only on an economic level but also on a 
cultural level (Mooers & Sears 1992). This “culture conflict" is manifested in 
the life-style emphasis of NSMs and is fought on symbolic and identity levels 
(Kauffman 1990, Weiner 1982). 

The second school of thought concerning the origins of NSMs places their 
cause in the subjective consciousness of the actors. There are two principal 
variants. The first is the "value shift" hypothesis that centers around the larger 
economic, political, and social context of Western nations.” This view states 
that the people of Western nations have reached a point of economic and po- 
litical security in the modern age (where their basic needs for sustenance and 
survival are relatively assured). This confidence led to a shift in public opinion 
(culture) away from issues relating to economic or political security to issues 
of personal growth and self-actualization. The underlying mechanism is based 
on a Maslowian psychological scheme where an individual is first and foremost 
concerned with issues of survival and security; once these basic needs are sat- 
isfied, one is able to move up the ladder to higher-order concerns (Falik 1983, 
Inglehart 1977, 1981). This scheme implies that the socialization experiences 
of various age cohorts, and the conditions of scarcity present during their forma- 
tive years, result in a fixed materialistic or postmaterialistic orientation. Thus, 
NSMs represent the shift to postmaterial values that stress issues of identity, 
participation, and quality of life rather than economic matters. But the value 


SThis is referred to by Poguntke (1993) as a structural change approach See also Kitschelt 
(1988) and Markovits & Meyer (1985). 
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changes are not the only attitudinal changes that result in NSMs. For Inglehart 
(1977), postmaterial values combine with increases in political efficacy and po- 
litical distrust to result in increased political activism. This view says nothing 
about actions of the state except in so much as it provides for the economic and 
political security of its populace. 

The second value change variant concerning the origins of NSMs is the “cycle 
of protests” argument. It states that NSMs are simply recent manifestations of 
a cyclical pattern of social movements (see Tarrow 1983). Some link the cycles 
to "anti-modern or romantic-ideological reactions to functional principles, con- 
tradictions, and alienating effects of modern societies" (Brand 1990:24; see also 
Eder 1982), others to recurring waves of cultural criticisms linked to changes 
in the cultural climate (Brand 1990) or to political and social events. These pe- 
riods are said to act as fertile ground for the proliferation of social movements 
sensitizing the population to the problems of modern societies (Brand 1990, 
Rudig 1988). 

However, the evidence supporting these views is not conclusive. The "state 
intrusion" hypothesis suffers because of the difficulty in empirically estab- 
lishing a link between the actions of the state and incidence of NSMs. Such 
connections can only be drawn by inference, by establishing the meaning and 
intent of actions of the state and hypothesizing about their consequences. "The 
work of interpretation is inevitably risky and less totalizing" (Cohen 1985:665). 
Specifically, the hypothesized changes in the state are vague and difficult to op- 
erationalize, and the mechanism by which actions of the state are linked to 
social movements is not specified. 

The “value shift" hypothesis, on the other hand, is based on empirical ob- 
servations of changes in public opinion over the last 30 years. The evidence of 
a value shift seems compelling. Opinion studies in both Europe and North 
America have chronicled a change in the values of the public that moved 
from economic to non-economic concerns, as is suggested by the hypothesis 
(Inglehart 1977, 1981, 1990, Kaase 1990). However, there are several problems 
with this thesis. 

First, the hypothesis locates the value change in a specific class segment: 
the so-called “new” middle class (Inglehart 1990). However, the “new” middle 
class is defined in widely divergent ways that contribute to inconsistent results 
from various investigations. This is also true of the indicators employed to 
mark the differences between the “new” middle class and other social classes. 
Several formulations of the "new" middle class (Ehrenreich & Ehrenreich 1977, 
Gouldner 1979, Kristol 1972, Ladd 1978, 1979) employed differing definitions 
of the new middle class and produced inconsistent results. Brint (1984) tested 
the four variations by looking at the presence of “antibusiness” and “egalitarian” 
sentiment among the groups as defined by each of the above researchers. He 
found that the four groups defined as composing the "new" middle class did 
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not possess dissenting attitudes. He concluded that "the theorists were simply 
incorrect in [that they] ... exaggerated the levels of dissent and even the levels 
of liberalism found in the new class" (p. 60). Brint's review shows that NSM 
scholars have failed to provide empirical evidence that would allow them to 
conclude with confidence that such a new and militant class, linked to NSMs, 
has arisen in the United States or Europe. 

The second problem with the value shift hypothesis 1s that the cause for the 
value change is imputed to be a function of the increasing security and pros- 
perity of modern Western countries. This connection is less amenable to direct 
empirical verification. Other possible explanations for the rise in postmodern 
values could be offered including the growing bureaucratization of society, or 
that the values are cyclical rather than rooted in the ability of the economic 
structure to provide for the material benefit of the population. 

Third, relying strictly on economic affluence as indicating postmaterialism 
may miss more important determinants of social values. As stated by Cotgrove 
& Duff (1981), "perhaps Inglehart and later researchers have been looking 
at the wrong kind of variables to explain support for postmaterialism. By 
concentrating on the level of affluence of an individual as determinant, they have 
neglected ideals [that is, personal values] as a possible cause" (p. 98). The point 
is that there may be a difference between public and private values and that one's 
public values may be more central to predicting support for postmaterial values. 

Fourth, the two elements of Inglehart's model, the arguments concerning 
scarcity and socialization, lead to contradictory expectations. As stated by 
Boltken & Jagodzinski (1985), 

If the responses to the value index reflect internalized value orientations, they should be 
fairly stable in adulthood. If, on the other hand, respondent behavior 1s affected to a larger 


extent by economic changes, usually no stability can be expected But the same sequence 
of actions cannot be both stable and unstable (p 444), 


The "cycle of protests" argument that ties changes in the cultural climate to 
waves of movements differs from the "value shift" hypothesis in that it adds a 
time dimension to the shift in social mood and makes no assertions about the 
structural source of social movements. There is no significant body of literature 
testing the historical dimension of the argument, although such shifts have been 
noted by other authors (Hirschman 1981, Huntington 1981, Namenwirth & 
Weber 1987, Schlesinger 1986). Brand (1990) has done a comparative study of 
Germany, England, and the United States that shows alinkage between phases of 


6A discussion of new social mcvements in Latin America, by Slater (1985) also indirectly 
refutes the belief that economic prosperity and the concomitant value changes it begets underlie the 
nse of NSMs Latin American nations can hardly be described as capable of satisfying the basic 
needs for economic and physical security of their people. However, without ascertaining whether 
the model provided by NSMs 1s in some manner being diffused, the existence of NSMs in Latin 
America cannot be used as definitive refutation of the value change hypothesis 
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cultural criticisms and the manifestation of new middle-class radicalism (social 
movements) from 1800-1990. Pichardo (1993) presents a similar argument for 
the Populist/Progressive era movements in the United States, which demonstrate 
striking similarities to the rise of NSMs. The middle-class radicalism of that 
period 1s linked to the advancing restrictions on community space brought about 
by industrialism. But these data are too preliminary to be convincing. 

However, in terms of whether NSMs are a product of the postindustrial age, 
the cycles-of-protest argument implicitly responds in the negative. For if the 
values are tied to the rise of postindustrial society or linked to the rise of the new 
middle class, then they could not have been present before. Brand’s formulation 
may be a way to reconcile the two views, but that would require the NSM thesis 
to be substantially altered. 

Yet another strain of critique casts doubt on the connection between the larger 
macro-historical societal changes associated with postindustrialism and NSMs. 
Olofsson (1988) argues that the “cultural revolutionary activities ... [of NSMs] 
can be articulated with very different politico-ideological formations, social 
groups, and classes” (p. 31). In other words, there is no necessary connection 
between postindustrialism and the project of NSMs. 

Similarly, the “cycles of protest” argument has an alternative explanation.’ 
Rather than seeing them as a consequence of changing values associated with 
modernization, which in turn dictate tactical options that are in concert with the 
values, another point of view suggests that changes in the tactical repertoire of 
movements are independent of the values and goals expressed by the movement. 
Rather than linking changes in tactics to new values and goals, they are to be 
seen more in terms of an evolving interaction between the agents of repression 
and movement actors where new tactics are an outcome of the ability of agents of 
repression to accommodate to the old tactics, thereby rendering them ineffective 
(Koopmans 1983). New tactics are a response to the need to find new forms of 
effective tactics. Another explanation for the waves of protest argues that they 
are the outcome of competition between movement organizations (Tarrow 1989, 
1991). In this view, the competition for resources and membership between 
movement organizations leads to innovation, militancy, then decline. Tarrow 
(1991) further argues that those who argue for the “newness” of contemporary 
movements have simply mistaken an early phase of movement development for 
a new historical stage of collective action. 

Finally, the NSM thesis limits the phenomenon to Western nations, yet some 
authors have attempted to extend the thesis to underdeveloped countries. Slater 
(1985), in an interesting turn on the NSM thesis, relates Latin American contem- 
porary movements to excessive centralization of decision-making power, the 
state’s incapacity to deliver adequate social services, and the eroding legitimacy 


7 This 1dea was suggested by Bert Klandermans 
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of the state." But the mere presence of NSMs in non-Western nations argues 
against both hypotheses—of state intrusion (because NSMs are triggered by the 
change to a postindustrial economy, a change not occurring in Latin American 
nations) or of value change (because the value changes are a product of the 
economic and physical security of a country’s population, which also cannot 
be said to be characteristic of Latin American nations). 

What remains is of questionable value as a paradigm or theory. The rela- 
tionship between structure and the rise of contemporary movements is at best 
uncertain, and most of the so-called unique characteristics and features are not 
unique. What seems to be unique is their ideological (identity) orientation—the 
one hypothesized characteristic that seems truly to mark a break from the past. 


SUMMARY 


So what have we? There is significant doubt in terms of whether contemporary 
movements are specifically a product of postindustrial society. The mechanisms 
cited within the NSM literature disagree significantly, depending on the varia- 
tion. In turn, each of these variations has significant weaknesses—inconclusive 
empirical support, questionable operationalization of variables, ambiguous or 
abstract concepts, and feasible alternative explanations. The same could be 
said for whether contemporary movements represent anything unique. Except 
for the issue of identity, the so-called unique characteristics of contemporary 
movements are not unique at all. At best, it can be argued that they are recent 
additions to the repertoire of social movements. But changes in repertoires 
have been noted by other authors (most notably Tilly 1979) without employing 
explanations that call for new theories. 

The principal contributions of the NSM perspective are its emphases on iden- 
tity, culture, and the role of the civic sphere—aspects of social movements that 
had been largely overlooked. The failure to attend to identity issues was rooted 
in the belief that social movement participation was instrumentally based. In 
fact, Parkin (1968) believes that the expressive dimension of participation may 
be a feature of social movements dominated by the middle class. The expres- 
sive nature of participation is linked to the cultural aspects of movements as the 
goal of expressive action is guided by a particular moral outlook concerning 
the appropriate normative order. The civic sphere, where culture resides, which 
had traditionally been seen as being dominated or determined by the economic 
sphere, is now seen as a locus of social protest. This "liberation" of the civic 
sphere has brought to focus the realization that the civic sphere is an area of 
contention just as are the economic and political spheres. 


5 Slater employed Mouffe’s commodification variation of the state-intrusion hypothesis, which 
is not linked to the capacity of governments to provide for the physical and economic security of 
its populace 
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However, the need for a new theory of social movements to account for 
these differences is uncertain. Not that social movement theory (in particular 
the resource mobilization theory) presented a complete account of movement 
dynamics (see Canel 1992, Klandermans 1986, Klandermans & Tarrow 1986). 
But a modification of theory that attended to the “why” of movements in addition 
to the “how” would seem to be a better direction to proceed. 


CONCLUSION 


The inability of the NSM school to adequately defend its most central propo- 
sitions argues against labeling or referring to the NSM thesis as a paradigm or 
theory about contemporary movements. Nor can simply arguing that the NSM 
thesis applies only to left-wing movements of the contemporary era salvage its 
image as a theory. Even limiting the thesis to oppositional movements (which 
traditionally come from the left) is insufficient as many conservative movements 
can also be so characterized. The added inability to argue convincingly for a 
typology of contemporary movements or a link to changes in sociceconomic 
structures leaves the NSM thesis untenable as a theory. 

Does that mean that there is nothing behind this school of thought? Is it, as 
Tarrow (1991) believes, that researchers mistook an early phase of movement 
development for a new historical stage of collective action, or did something 
genuinely unique take place? The principal question is not whether contem- 
porary movements display unique characteristics, for the tactics and styles of 
movements are often a function of expediency rather than principle and thus are 
guided by the utilitarian logic of achieving goals. It1s more a matter of whether 
contemporary movements are reacting to the changing nature of domination in 
the postindustrial world. Put another way, are contemporary movements unique 
because of the unique character of domination necessitated by the emergence 
of the postindustrial era? 

In my opinion, any rescue of the NSM thesis must begin with an inclusion of 
contemporary conservative movements both presently and historically. With 
such an inclusion the process of government intrusion into the civic sphere 
would be more clearly highlighted as conservative mobilizations are also re- 
acting to the actions of governments to control the civic sphere. The religious 
values that underpin the ideological structure of many conservative movements 
must be seen as an additional source of friction that has always resided in 
and traditionally dominated the civic sphere. With the separation of church 
and state that took place in the eighteenth century, religious groups lost their 
state-sanctioned privileged position within the civic sphere.” Many conservative 


* This was because the coercive mechanisms of the state were no longer available to religious 
groups to sanction wayward believers Thus, the cultural behavioral codes preferred by religious 
groups could no longer be guaranteed to dominate the civic sphere 
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mobilizations in the United States of the eighteenth and nineteenth centuries 
can be understood as attempts by religious groups to coerce the state to enforce 
behavioral and moral codes consistent with their beliefs. Thus, the evolving 
history of the civic sphere and the social and political conflicts involving the 
civic sphere need to be detailed. In this way, we can better understand the 
nature of modern conflict and the role of the civic sphere in generating and 
maintaining that conflict. It is the observation concerning the role of the civic 
sphere (where culture and identity reside) in modern conflict that is perhaps 
the most provocative and informative aspect of the NSM thesis and the element 
around which a reformulation of the NSM thesis should be constructed. 
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ABSTRACT 

This chapter critically examines the hypothesis that women's rising employment 
levels have increased their economic independence and hence have greatly re- 
duced the desirability of marriage. Litt:e firm empirical support for this hypoth- 
esis is found. The apparent congruence in time-series data of women's rising 
employment with declining marriage rates and increasing marital instability is 
partly a result of using the historically atypical early postwar behavior of the 
baby boom era as the benchmark for comparisons and partly due to confounding 
trends in delayed marriage with those of nonmarriage. Support for the hypothesis 
in multivariate analyses is found only in cross-sectional aggregate-level studies, 
which are poor tests of an individual-level behavioral hypothesis and which also 
present difficulty in establishing the appropriate causal direction. Individual-level 
analyses of marriage formation using longitudinal data and hazard modeling uni- 
formly fail to support the hypothesis, while analyses of marital dissolution yield 
mixed results. Theoretically, the hypothesis also has severe limitations. The 
frequent tendency to equate income equality between spouses with women's eco- 
nomic independence and a lowered gain to marriage fails to distinguish between 
situations where high gains to marriage may be the result of income equality from 
situations where the result is a very low gain to marriage. Focusing on income 
ratios alone also tends to distract attention from the underlying causes of these 
ratios and their structural determinants. Finally, the independence hypothesis is 
based on a model of marriage that views the gain to marriage as a result of gender- 
role specialization and exchange. Historical evidence on the family indicates that 
this is a high risk and inflexible family strategy for independent nuclear families 
and one that is in strong contrast to contemporary family patterns. 
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INTRODUCTION 


The economic role of American women, especially married women, has been 
undergoing a major transformation since World War II. These changes have 
inspired a large and varied literature, some of it focusing primarily on women's 
labor-market status and some on the familial implications of women's expanded 
work roles. This chapter emphasizes one facet of the latter body of work: 
the hypothesis that women's rising employment has led to a decrease in the 
desirability of marriage and hence is responsible for what is increasingly being 
called the "retreat from marriage” or, alternatively, the “decline of marriage.” In 
the pages below I critically examine both the theoretical basis of this hypothesis 
and the empirical research it has inspired. 

A long-standing and influential tradition in the social science literature is 
the importance of differentiated sex roles for a stable marriage system. In the 
early postwar period this idea was emphasized by Talcott Parsons (1949), who 
argued that sex-role segregation was a functional necessity for marital stability 
and even for the viability of society itself. More recently, this same theme has 
been elaborated somewhat differently in the economic theory of marriage by 
Gary Becker (1981). In an argument very similar to one Durkheim (1960) made 
over a century ago, Becker maintains that the major gain to marriage lies in the 
mutual dependence of spouses, arising out of their specialized functions—the 
woman in domestic production (and reproduction), the man in market work. 
Marriage then involves trading the fruits of these different skills. In response to 
economic growth and the rising wages it produces, however, women's market 
work also rises. The result is that women become less specialized and more 
economically independent, leading, in turn, to a decline in the desirability of 
marrying or of staying married. 

Not all scholars necessarily agree with Becker's argument in its entirety. 
Nevertheless, an economic independence argument of one sort or another has 
had wide appeal among sociologists as well as economists and is currently one 
of the major contenders among attempts to explain recent marriage and fam- 
ily trends in the United States (Ross & Sawhill 1975, Cherlin 1979, 1992, 
Preston & Richards 1975, Waite & Spitze 1981, Fuchs 1983, Espenshade 
1985, Goldscheider & Waite 1986, Farley 1988, Schoen & Wooldredge 1989, 
McLanahan & Casper 1995). One reason is that the model seems to have 
tremendous face validity. The notion of specialized sex roles fits in well with 
our views of how the traditional family historically functioned, and the rapid 
changes in marital behavior appear to have followed very closely upon the rapid 
rise in married women's employment, especially that of young women. More- 
over, the theory has the elegance of simplicity; yet it can apparently explain 
a wide variety of complex changes and differentials in marriage and family 
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behavior—from delayed marriage to nonmarriage, marital instability, nonmar- 
ital cohabitation, female-headed households, declining fertility, and so forth. 

This review examines several aspects of the research on the hypothesis that 
women’s increasing market work has discouraged marriage formation and en- 
couraged marital instability, hereafter referred to as the “independence hypoth- 
esis.’ The discussion deals first with the work relevant to the empirical status 
of the hypothesis and then critically examines the underlying theory—the spe- 
cialization and trading model of marriage. 


EMPIRICAL ISSUES 


The following discussion investigates four major aspects of the empirical anal- 
ysis of the independence hypothesis. The first examines how well simple time 
series data match the patterns predicted by the theory. The second investigates 
whether the marital behavior predicted by the theory is what is actually be- 
ing observed. Next we consider the result of multivariate tests of the theory, 
distinguishing between aggregate- and individual-level analyses. 


Evidence from Time Series 


WORK AND MARITAL BEHAVIOR The apparently close correlation between the 
trends in married women’s work and marriage behavior is cited by many schol- 
ars as providing general support for the independence hypothesis (Becker 1981, 
Davis 1985, Espenshade 1985, Farley 1988, Cherlin 1992, McClanahan & 
Casper 1995). And the changes are very impressive. On the one hand, al- 
though married women’s employment had been slowly rising throughout the 
twentieth century, this shift was greatly accelerated in the postwar period. Most 
importantly, starting in the 1960s, the rise in the labor force participation of 
young married women really accelerated, and since then the age pattern of their 
employment has increasingly resembled men’s, with proportions approaching 
70% for women into their mid-fifties (Oppenheimer 1994). On the other hand, 
dating from the late 1960s, very substantial changes have occurred in marriage 
and family behavior. As a consequence, by the 1990s, age at marriage was 
much later than in the 1960s, and the proportions of those never marrying may 
be rising as well. For example, the proportions of women who were never 
married by age 25 rose from 14% in 1960 to 42% in 1990 (US Bureau of the 
Census 1991). All this is especially true for African Americans. For all, marital 
instability has also become much more prevalent, and although remarriage rates 
may be high, they are lower than in the past; female-headed families have been 
on the rise, again especially for African Americans. All these changes in mar- 
ital and family behavior, on the one hand, and married women’s employment 
behavior, on the other, tend to create the presumption that there must be a major 
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causal connection between the two, and the economic independence argument 
seems to provide one important possible explanation. 

Impressive as the time-series data are, however, their support of the inde- 
pendence hypothesis may be more apparent than real. The problem is that 
if one follows the historical series back in time—before the 1950s—marriage 
and family behavior are no longer so nicely correlated with trends in women's 
employment. Central to the hypothesis is the idea that when the traditional 
family prevails, characterized as it is by marital sex-role specialization, and the 
gain to marriage is presumably high, age at marriage will be young, marriage 
will be universal, and marital instability low. However, age at marriage was 
by no means historically low in American society, any more than it was in 
Western European societies (Cherlin 1992, Oppenheimer 1994, Hajnal 1965, 
Goldstone 1986, Banks 1954). Age at marriage seems to have been quite young 
for American women around 1800 but subsequently rose and was relatively 
high and quite variable for late nineteenth century cohorts (Sanderson 1979, 
Thornton & Rodgers 1983). Age at marriage, as well as its variability, then 
declined, reaching a low for the cohorts marrying in the early post- World War 
II period. Since then, it has risen again to the levels exhibited by early twentieth 
century cohorts; in fact, as Cherlin (1992) points out, it appears that women's 
age at marriage has now risen even above that of late nineteenth century cohorts. 
However, this is probably not too significant since, compared to the present day, 
the delayed marriages of late nineteenth century cohorts existed despite much 
higher proportions of rural dwellers whose age at marriage was relatively young 
(Taeuber & Taeuber 1958). 

In short, the early postwar period did not represent "typical" traditional mar- 
riage timing patterns. On the contrary it was most atypical in its young age at 
marriage and low variability in marriage timing. Nevertheless, that period has 
become the benchmark against which all subsequent behavior has been com- 
pared. More historically typical was a tendency for age at marriage to fluctuate, 
probably in response to changing circumstances, and long before one could es- 
tablish a major causal role for women's economic independence (Thornton & 
Rodgers 1983, Oppenheimer 1994). In fact, age at marriage was dropping in 
the first half of the twentieth century at the same time that single women's 
market employment was rising (Goldin 1990). 

With regard to marital instability, although the proportion of all marriages 
ending in divorce has risen substantially since the 1950s and 19605, the trend 
toward increased marital instability is of long-standing in American society and 
greatly predates the postwar rise in married women's labor force participation 
(Preston & McDonald 1979). There is also some evidence that part of the 
rise in marital instability may be more apparent than real—at least for African 
Americans. Thus, in an analysis of the 1910 census public use sample, Preston 
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and his colleagues (1992) found evidence that, high though mortality was, 
there was also a considerable over-reporting of widowhood among African- 
American women with children, indicating that there was much more marital 
or union instability in this period than was directly reported in the census or 
than is commonly believed to be the case. Some evidence also suggests that 
the long-term increase in marital instability has leveled off and could possibly 
be reversing itself (Martin & Bumpass 1989, Schoen & Weinick 1993), but it 
is too soon to determine this conclusively, especially given the possible role of 
business-cycle fluctuations and shifts in the proportion of separations ending in 
divorces. However, the divorce rate peaked in 1979 at 22.8 per 1000 married 
women aged 15 years and over and has declined gradually since then, reaching 
19.8 in 1995, the lowest rate since 1974 (National Center for Health Statistics 
1990, 1995). Given the well-documented tendency of marriages formed at a 
young age to be unstable, the rise in delayed marriage may be contributing to 
the declines in marital instability. 

One marriage-related trend that is of fairly recent origin is the rapid rise in co- 
habitation in the United States. Even descriptive data on cohabitation was very 
poor until the 1987—1988 National Survey of Family and Households provided 
valuable retrospective data on cohort shifts in cohabitation behavior (Bumpass 
& Sweet 1989). That study revealed that although only 3% of the 1940-1944 
cohort of women had cohabited before age 25, 37% of the 1960-1964 cohorts 
had done so. However, the extent to which this enormous growth in cohabita- 
tion represents a "retreat" from marriage greatly depends on the extent to which 
cohabitation is a substitute for marriage or a stage in the courtship process. In 
this respect, cohabitation appears to be a heterogeneous phenomenon. An in- 
creasing proportion of first marriages start as cohabitations—Trising from 9% 
for the first marriages formed in 1965—1974 to 39% for the 1980-1984 cohorts. 
Moreover, of first cohabitations started in the 1975—1984 period, 56% turned 
into marriages by the end of the fifth year. Hence, the rapid rise in cohabitation 
should be contributing to marriage delays, but for a substantial proportion of 
cohabitators it does not signify a rejection of marriage. In fact, Bumpass and 
his colleagues (1991) found that most of those currently cohabiting expected to 
marry their partners. Moreover, in his event-history analysis of young women's 
marriage formation from the NLSY (National Longitudinal Survey of Youth), 
Lichter (1992) found that the odds a woman married in a year were 6696 higher 
1f she was cohabiting at the previous interview. Hence, here cohabiting seemed 
to be operating as a proxy for being engaged. 

In sum, trends in marital instability ^ad their inception long before the rapid 
rise in married women's employment started. In the case of marriage formation, 
the historical record is characterized by considerable fluctuations, indicating a 
responsiveness to changing circumstances rather than the pattern of early and 
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universal marriage supposed to be characteristic of the specialization model 
of marriage. And while cohabitation is a relatively recent phenomenon, for a 
substantial segment of those who cohabit it obviously represents a stage in the 
courtship process rather than a retreat from marriage. In short, these patterns 
raise serious doubts about putting too much reliance on a truncated time series’ 
apparent support of the independence hypothesis. 


DELAYED MARRIAGE OR NONMARRIAGE? ‘The independence hypothesis is es- 
sentially a theory of nonmarriage, for it is arguing that if, by their own en- 
deavors, women can achieve approximately the same income as a prospective 
spouse, there is not much to gain by marrying and specializing in home pro- 
duction. However, to date, there has been little effort to articulate how this 
argument could lead to delayed instead of nonmarriage. If most of the changes 
we are observing are shifts in marriage timing rather than in nonmarriage, then 
the independence hypothesis is not really highly relevant. It will, in fact, be 
garnering far more empirical support than it analytically deserves if there is 
a serious confounding of delayed marriage with nonmarriage in the statistics. 
Moreover, a considerable delay in marriage may, even on its own, promote some 
nonmarriage, especially for women whose marriage-market position appears 
to deteriorate with age (Goldman et al 1984, Watkins 1984). 

Aside from the confounding problem, nonmarriage and delayed marriage are 
two rather different phenomena. There are many reasons why varying numbers 
of people may want to or feel compelled to delay marriage but still wish to 
marry eventually—i.e. they still see a major gain from marriage. For example, 
economic factors, school enrollment, service in the military, getting established 
in a career, and so on, may all signify the necessity or advantage of delaying 
marriage without affecting the desirability of marriage per se. Hence, theories 
designed to explain nonmarriage may not be very pertinent if much of what 
is really happening is delayed marriage. Moreover, the particularly late age at 
which late marriers marry in a period of delayed marriages greatly increases 
the difficulty of interpreting whether or not currently observed trends signify 
a considerable rise in nonmarriage. This is especially the case for men, who 
usually marry later than women. 

So far, the evidence indicates that, while there is considerable evidence of 
delayed marriage, nonmarriage will not rise markedly for white women and 
will remain at or below the proportions for late nineteenth century cohorts. 
For example, various estimates made in the 1980s suggest that about 9% of 
white women born in the early 1950s would never marry, up from the low 
of 596 for the cohorts born during the 1920s but below the 1296 for the 1880 
cohort. Marriage is so delayed among African Americans that predictions 
of the proportion never marrying are much riskier; however, these estimates 
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have ranged from about 25% to 30% (Rodgers & Thornton 1985, Bennett et al 
1989). In sum, the independence argument may still be potentially useful in 
explaining the apparently sharp rise in nonmarriage among African Americans. 
However, unless the hypothesis can be made more explicitly relevant to the 
question of delayed marriage, it can explain little of the observed trends in 
marriage formation for white women despite the enormous postwar changes in 
their labor-market behavior and, more recently, in their marriage behavior as 
well. 


Multivariate Analyses 


The use of time series data in thé investigation of the role of women's rising 
employment in changing marital behavior has been largely limited to illustrat- 
ing the similarity among the two trends. À more rigorous approach involves 
the use of multivariate techniques to try to establish the causal connection be- 
tween women's labor-market behavior and marital behavior. Basically two 
types of multivariate approaches have been used to test the independence hy- 
pothesis, each of which is usually applied with a different type of data. One is 
aggregate-level analyses, which typically use census data; the unit of analysis 
is a geographic area such as a metropolitan area or a labor market area. The 
second and larger group of studies consists of micro-level analyses of longitu- 
dinal data, where the individual is the unit of analysis. Micro-level analyses 
generally use samples that have longitudinal individual-level data, either of a 
retrospective nature or because they are panel studies, or they may include a 
combination of both sources of life-history data. A recent development is also 
to supplement the micro-level data with aggregate-level information in the form 
of contextual variables in order to provide information on the characteristics of 
the respondent's marriage market (Lichter et al 1992, Lloyd & South 1996). 


AGGREGATE-LEVEL ANALYSES The major support for the independence hy- 
pothesis in the regression analyses of marriage formation has come from ag- 
gregate level analyses using census data (Preston & Richards 1975, Lichter 
et al 1991, Fossett & Kiecolt 1993, and McLanahan & Casper 1995). The 
dependent variables in these studies are all prevalence measures—for example, 
the proportion of women who are currently or recently married. The explana- 
tory variables relevant to the independence hypothesis are usually measures 
of employment and of economic status—typically, earnings. Some also have 
educational attainment data, useful as an indicator of long-term labor-market 
position. In general, all these studies have found that earnings (or an SES 
indicator in the case of the Fossett & Kiecolt study on African Americans), em- 
ployment, and schooling (where included) had a negative impact on the marital 
status composition of an area. 
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While these results appear to support the independence hypothesis, aggregate- 
level analyses of marriage timing have several serious drawbacks. One is that 
the hypothesis itself refers to individual-level behavior, and testing it with 
aggregate-level data can lead to misleading results because the same factors 
that produce area-level differences in the prevalence of married people do not 
necessarily produce the same kind and level of individual differences in the 
incidence of marriage. Moreover, it is not at all unusual to get opposite-sign 
results from macro- and micro-level analyses of approximately the same phe- 
nomenon, leading to what has been called the “ecological fallacy” (Robinson 
1950, Achen & Shively 1995). 

One reason why aggregate-level tests of the independence hypothesis may be 
misleading comes from the difficulty of establishing causal ordering between 
marital status and economic behavior by using cross-sectional census-type data. 
One cannot assume that the economic behavior and characteristics of women 
are necessarily the determinants of marriage characteristics because, in many 
circumstances, they may be the consequences. One of the usual "solutions" 
to the causal ordering problem in social research is to try to establish the ap- 
propriate time order of the variables. However, with census data, most of the 
variables are measured at the time of the census or, at best, during the previous 
year. Moreover, the dependent variable 1s a prevalence rather than an incidence 
variable. The proportion who are currently married in any particular area is a 
residual of those who were married at some unspecified time prior to the census 
and who did not separate, divorce, or become widowed before the census or, if 
they did, had remarried. And for an unknown number of women, none or only 
a few of these events may even have occurred in the area in which they were 
living at the time of the census.! In short, the proportion married in an area at 
the date of a census is a complex variable, resulting from a number of processes 
that have occurred over an unspecified length of time and in unspecified loca- 
tions, making it very difficult to determine time order or the spatial connections 
essential to establishing causal connections. 

One recent analysis well illustrates some of these problems. Using metropoli- 
tan areas in the 1970-1990 censuses as the unit of analysis, McLanahan & 
Casper (1995) regressed the percentage of women aged 25-29 who were mar- 
ried on, among other variables, the percentage of women, also aged 25-29, who 
were working full-time year-round. They found that employment experience 
had an apparently large negative effect on marital status and interpreted this as 
support for the independence hypothesis. However, the true causal direction is 
very likely to be from marital-status composition to employment composition, 


lThus in their analysis of labor market areas (LMA) in the 1980 census, Lichter and his col- 
leagues (1991) found that, on average, 33% of currently married women, aged 20-29, had in- 
migrated into the LMA in the previous five years 
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rather than the reverse. Married women in the 25-29 age group are in the midst 
of their reproductive period and likely to have young children; we also know 
from individual-level data that such women are less likely to work year-round 
full-time than other women in that age group. For example, in 1990, for the 
United States as a whole, although 68% of mothers of children under six had 
some work experience during the previous year, only 28% worked year-round 
full-time (Bianchi 1995, p. 125). Moreover, married women, especially those 
with young children, were even less likely to be so employed in previous cen- 
suses. Hence, if there is a higher proportion of women aged 25~29 who are 
married in some areas rather than others or married with young children, the 
proportions working year-round will be greatly depressed in those areas. Simi- 
larly, metropolitan areas with a higher proportion of single or separated/divorced 
women are likely to have considerably higher proportions working year-round 
full-time because these women usually have to work to support themselves. In 
sum, given the difficulty of unambiguously establishing causal ordering in such 
analyses, combined with a frequently strong argument for a causal direction 
that is the reverse of that hypothesized, aggregate-level analyses of this nature 
offer a poor empirical test of the independence hypothesis. 

However, an entirely different approach from the regression analyses dis- 
cussed above has been developed by Schoen (1988), and it comes up with 
rather different conclusions. Schoen's approach is basically a marriage market 
analysis and has the advantage of including both sexes and hence can distin- 
guish between the effect of changes in the availability of partners with certain 
characteristics and the desirability of these partners—what Schoen has called 
the “force of attraction.” Using Current Population Survey data, Qian & Preston 
(1993) applied this model to changes in white marriage behavior from 1972- 
1979-1987 by age and educational attainment. Among other things, they found 
that marriage propensities rose among college educated women over age 25 in 
the 1979—1987 period, despite a decline in the availability of college educated 
men in the appropriate age groups, indicating that women in the presumably 
most favorable labor-market position were not using their economic indepen- 
dence to avoid marriage and were sufficiently attractive marriage prospects that 
they were able to overcome a marriage squeeze situation.? In addition, Qian 
& Preston observed a sharp rise in the “force of attraction" for older women 
compared to younger between 1979 and 1987, suggesting that delayed rather 
than nonmarriage was occurring. 


2These findings were in strong contrast to a study by Bennett & Bloom that was widely reported 
1n the popular press but never published in a peer-reviewed journal They estimated that college- 
educated white women who were still single at age 30 would have only a 2096 chance of ever 
marrying. For a discussion of the popular and scientific controversy surrounding these estimates, 
see Cherlin 1990 
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MICRO-LEVEL ANALYSES In contradistinction to the results from aggregate 
level analyses, multivariate studies of the marriage formation behavior of in- 
dividuals have rarely provided any support for the independence hypothesis. 
Most of these studies utilize longitudinal data and were thus able to establish 
time order between the explanatory and dependent variables; hence they can 
make a better case for causal ordering, although time order is not always a reli- 
able criterion. Most also are event-history analyses of one sort or another and 
hence are able to avoid the inherent selectivity biases that used to characterize 
analyses of the age at marriage of samples of relatively young ever-married 
people (Yamaguchi 1991). 

While the independence argument would lead to the prediction that, net of 
school enrollment, more educated women should be more economically inde- 
pendent of marriage, these micro-level regression analyses show that they have 
a higher rather than a lower propensity to marry (Cherlin 1980, Goldscheider 
& Waite 1986, Mare & Winship 1991, Lichter et al 1992, Oppenheimer & 
Lew 1995, Oppenheimer et al 1995). Moreover, there is some evidence that 
the positive effect of schooling on marriage formation has even been increas- 
ing over time (Oppenheimer et al 1995). Educational attainment can also be 
an important factor in remarriage. Using the National Survey of Family and 
Households (NSFH), Smock (1990) found that although, compared to white 
women, African Americans generally had a much lower likelihood of remar- 
riage, schooling had a very strong positive effect on remarriage probabilities 
for African American women whereas it had little impact for whites. 

With regard to the effects of employment, the findings also do not support the 
independence hypothesis (Cherlin 1980, Goldscheider & Waite 1986, Bennett 
et al 1989, Lichter et al 1992, Oppenheimer & Lew 1995). One exception 
was Mare & Winship's 1991 micro-level analysis of 1940—1980 census public 
use sample data, which found a generally negative effect of employment on 
marriage formation (for white women). However, the employment variable 
used was constructed rather than observed and hence may not be an entirely 
unbiased indicator. Women's earnings are also almost always found to have 
a positive effect on the likelihood of marriage, again the reverse of what is 
predicted by the independence hypothesis (Goldscheider & Waite 1986, Mare 
& Winship 1991, Lichter et al 1992, and Oppenheimer & Lew 1995). More- 
over, the Oppenheimer & Lew study found that the positive effect of earnings 
only showed up for women in their mid-to-late twenties rather than among 
younger women, for whom no effect was observed—a pattern that is the op- 
posite of what would be true if greater economic independence increased the 


Because it is generally 1mpossible to determine the time order of variables ın the census, Mare 
& Winship estimated "expected" employment but the variables used to develop these predictions 
were also measured in the same year as the marriage occurred. 
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likelihood of never marrying. Oppenheimer & Lew also looked at the effect of 
occupational level to see whether women in professional and managerial occu- 
pations would be any less likely to marry than those in lower-level white-collar 
occupations—traditionally more typical women’s occupations that might be 
more easily combined with marriage. They found no significant effect except 
for a negative one for unskilled workers. One micro-level panel study found 
no effect of weekly earnings (Teachman et al 1987); however, only weekly 
earnings in the first week in October of each year were available to use, and 
this may have distorted the results. 

A characteristic of virtually all these micro-level analyses of marriage forma- 
tion is that they use employment and earnings data of the recent past—typically 
referring to the time of the interview or the year just before each year at risk. 
This raises several conceptual and measurement problems. First, employment 
has been both the statistical and social norm for single out-of-school women 
for over 50 years (Goldin 1990). Aside from the nonemployment that can 
be attributed to business cycles, those who are not employed should therefore 
be a highly select group. Hence, it is unclear what the employment status of 
single women is really measuring; it certainly is not distinguishing between 
“traditional” and “nontraditional” women. Second, if what we want to get at 
is the likely long-term labor-market position of women to assess their “inde- 
pendence” of the marriage state, the recent earnings or employment patterns 
of respondents may be very poor indicators of this, especially given the low 
earnings and employment instability characteristics of young people, males as 
well as females. Hence, it is not too surprising that these variables have no 
negative impact on marriage formation. Probably the best indicator of long-run 
labor-market position used to date is educational attainment; however, this too 
has not exhibited a negative effect on women’s marriage formation. 

In sum, several aggregate-leve] studies found that women’s education, em- 
ployment, and earnings were negatively related to the proportions married in 
an area. However, studies of this type generally have serious methodological 
drawbacks as a vehicle for the analysis of individual-level phenomena and are 
usually incapable of establishing causal ordering. Moreover, life-history anal- 
yses at the more appropriate micro-level, and which typically employ more 
sophisticated methodologies, have found that similar indicators of labor market 
position have either little effect or a positive one on marriage formation. These 
results may reflect offsetting factors at work and hence do not necessarily mean 
there is no such thing as an independence effect. However, they do indicate 
that it is unlikely that the independence effect is the driving force behind recent 
trends in marriage formation. 

The analysis of marital instability also provides an opportunity for testing the 
independence hypothesis. Moreover, if one has information on both partners, 
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it is possible to study not only whether couples with higher-income wives are 
more likely to separate but, in addition, whether women’s relative earnings 
are an important factor in marital stability. Given the availability of data on 
husbands, therefore, most panel studies during the past 25 years have included 
both husbands’ and wives’ economic behavior in their analyses of the effect 
of wives’ employment on marital instability. Generally, the problem has been 
formulated in terms of “income” and “independence” effects (Ross & Sawhill 
1975, Cherlin 1979). The idea is that a higher income (whatever the source) 
improves the quality of family life, thereby contributing to family stability. 
Since the husband has typically been the source of most of the family’s income, 
his labor-market position should have an important effect on marital stability. 
Husband’s economic position has usually been conceptualized in terms of both 
employment characteristics and earnings in recognition of the possibility that 
it is the stability of a man’s income as well as its size that may affect marital 
outcomes. A wife’s earnings can also have an income effect, but offsetting 
this is the hypothesized independence effect of her earnings as well as of other 
sources of income such as AFDC. 

By and large, the results of investigations of the independence and income 
effects on marital stability have been mixed. Some studies have found evi- 
dence of an independence effect (Ross & Sawhill 1975, Hannan & Tuma 1978, 
Cherlin 1979, Moore & Waite 1981, Tzeng 1992), whereas others have come 
up with negative results (Bumpass, Martin & Sweet 1991, Hoffman & Duncan 
1995,* Greenstein 1990, 1995, South & Lloyd 1995, and Tzeng & Mare 1995).° 
There are a number of possible reasons for these conflicting findings, including 
differences in the data sets analyzed, variations in the conceptualization of the 
problem and in the variables included in the model. A few issues are important 
to mention, however. 

Analyses of marital dissolution are best conducted by taking the marriage at 
its start and studying the factors affecting it over several years’ duration. Oth- 
erwise, the sample is increasingly biased across marital durations due to the se- 
lective withdrawal of the more divorce prone and those who may have dissolved 
their marriages for the very reasons the investigator is studying. However, stud- 
ies using many of the earlier data sets were hampered by serious left-censoring: 


*The generalizability of this study 1s somewhat limited, however. The major interest of the re- 
searchers was to measure the influence of welfare benefits on marital disruptions; hence they limited 
the sample to couples with children. The analysis may therefore underestimate an independence 


effect, should it exist. 
*Tzeng & Mare’s results were actually somewhat mixed They found that the combined income 


of the couple had no effect on the odds of a marital disruption and neither did the difference between 
the earnings of the partners—all of which failed to support the independence argument However, 
an increase in the wife's earnings over the marriage did have a small positive effect on the likelihood 
of a mantal disruption 
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i.e. marriages of varying durations were picked up in the sample and, although 
followed from that point on, it was not possible to reconstruct the entire history 
of a marriage and most particularly the history of the explanatory variables be- 
fore the panel started. This was the case with the PSID (Panel Study of Income 
Dynamics) used by Ross & Sawhill (1975) and the NLS panel of women aged 
30—44 at the first interview (Cherlin 1979). Hence, there is bound to be a cer- 
tain amount of selectivity bias in these samples, and it is difficult to determine 
how this affected the results although it probably biases coefficients down- 
ward. Moreover, the methodology of analyzing longitudinal data has greatly 
improved in recent years and now usually involves hazard modeling of one sort 
or another. Except for the pioneering methodological work of Hannan & Tuma 
(1978), these methods were not yet generally available for the earlier studies, 
and it is these studies that have tended to show that women's market position 
is likely to lead to marital disruption. 

Two additional factors may produce different results among studies, and 
these refer more to how the independence effect is conceptualized and mea- 
sured. Some studies use absolute income while others use some measure of 
relative income. While the hypothesis implies that the relative earnings of 
wives might be an important factor in the gain to marriage and hence should 
be directly modeled, using the ratio of wives' to husbands' income alone can 
lead to ambiguous results. Thus, one study found that the higher the ratio of 
wife's/husband's income, the greater the likelihood of a separation (Cherlin 
1979). But there are two very different explanations of why relative income 
might be higher for some families than others. One is that the wife's earnings 
in some families are higher compared to others. reflecting a better labor-market 
position and, consequently, indicating a greater financial independence of her 
husband. However, the ratio could also be high because the husband's earnings 
are low compared to other men, while the wife herself may be also in a very 
weak labor-market position. This latter situation does not really seem to fit the 
original conceptualization of the independence effect. Furthermore, given the 
sex differentials in earnings, husband's earnings are likely to be quite low when 
the ratio is high indicating that the "independence" and "income" effects may 
be confounded. Hence, the income of the husband also needs to be controlled 
in some fashion. 

A second problem, characteristic of almost all studies of women's economic 
role in marital instability, is the difficulty of establishing causal direction. This 
is one case where time order is not really a reliable indicator of causal ordering. 
Finding out that the likelihood of a separation in any given year increases when a 
woman's earnings or employment position were more favorable in the previous 
year does not necessarily support the independence hypothesis; this is because 
women who believe their marriage is in trouble may increase their work effort, 
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and hence their earnings, in anticipation of a breakup. A number of researchers 
have suggested this possibility, and at least two empirical analyses have found 
some evidence that this is the case (Johnson & Skinner 1986, Peterson 1989). 
This possibility of a reverse causal direction suggests that the general practice of 
limiting regression models to information on the behavior and characteristics of 
respondents in the immediate past may not provide enough data for an adequate 
evaluation of the independence hypothesis (Spitze 1988). In fact, studies of 
marriage formation are also generally weak in building in information on the 
individual's life course and tend to limit themselves to data on the very recent 
past. Furthermore, few studies follow young people long enough to capture 
life-cycle changes in explanatory variables or long enough for a relatively large 
proportion to have made the transition to marriage. Hence right-censoring may 
be a particular problem in this period of delayed marriage. 


THEORETICAL ISSUES 


Income Equality and the Gain To Marriage 


Although much used, “economic independence" is really arather vague concept 
whose ambiguities need to be more clearly recognized in assessing the effect 
of women's market work on marriage behavior. A relatively small amount of 
earnings may actually provide a married woman with the ability to act inde- 
pendently in the sense of making a variety of consumption decisions on her 
own and increasing her influence in joint consumption decisions. And many 
women could, with much less income than a prospective spouse, live at a level 
comparable to families with only the husband-father working. Whether this 
would also be true for mothers heading their own households is somewhat less 
clear and partly depends on how much the cost of childcare offsets the greater 
expense of a two-adult household. In any event, a lot of independence can be 
bought with earnings that are well below those of similar males who are sup- 
porting a family on their earnings alone. This, of course, has always provided 
the rationale for labor market discrimination against women. Multivariate anal- 
yses that use women's absolute earnings may implicitly be getting at this sort 
of independence. 

However, a recurrent theme in sociological discussions of the independence 
hypothesis is the notion that a woman's economic independence is defined in 
terms of income equality with her husband and that, furthermore, the gain to 
marriage is much less with economic independence defined in this manner 
(Cherlin 1979, Espenshade 1985, Farley 1988, McLanahan & Casper 1995, 
Sgrensen 1995). While this might be considered the logical conclusion of 
Becker’s theory, it is an approach that may often confuse rather than help clar- 
ify the nature of the social processes we are trying to study. Neither economic 
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independence nor low gains to marriage necessarily follow from income equal- 
ity; moreover, income equality may lead to low gains to certain marriages but 
not because economic independence has been achieved. Basically, the issue 
revolves around the question: What is economic independence, and what does 
it have to do with income equality and the gain to marriage? I briefly consider 
some of these issues below. 

Using 1940-1980 census data, Sørensen & McLanahan (1987) addressed the 
problem of how to measure the extent of married women’s dependency (on their 
husbands) and how this has changed over time. The measure they developed 
assumed that husbands and wives pool all their economic resources and share 
them equally; economic dependency is then defined as the difference between 
the husband’s and wife’s relative contribution to their combined income. If 
the wife’s income is equal to the husband’s then there is no dependence. As 
expected, given women’s rising employment, they found that their indicator of 
married women’s dependency decreased over time. However, it seems very 
likely that the measure does not just reflect degrees of dependency but also 
taps differences in the nature of the dependency involved. Two-earner families 
where the couple pool their resources are families whose level of living is based 
on their combined income. When the income contribution of each spouse is 
equal, given economies of scale, neither partner could live as well on his/her 
own or save and invest as much. Hence if, under the pooling assumption, each 
is dependent on their joint income, what is really being measured 1s how sym- 
metrical the economic dependency is. A wife (or husband) who has little or 
no income exhibits a very asymmetrical as well as a large income dependency 
and presumably must make other important types of contributions to the mar- 
riage for there to be a gain to both partners. This is the classical picture of the 
specialization and trading model of marriage. 

Couples with equal earnings also exhibit dependence, however—a symmet- 
rical dependency in this case. Although increasing symmetrical dependency 
should improve women's status and bargaining position in the family, marriage 
still involves important economic interdependencies, and there should still be 
a substantial economic gain to marriage for this reason alone. What this may 
also indicate is that the combined income of the two-earner family has come 
to form the social standard, rather than the husband's income alone. To the 
extent this is the case, it becomes increasingly difficult for single earners and 
married couples with a more traditional division of labor to achieve the same 
level of living as the two-earner family. Hence, the mutual dependence of the 
two-earner family may not only contribute to their own gain to marriage but 


6However, even 1f women achieved economic equality with men ın the labor market, this would 
not necessarily translate to 1ncome equality in a marriage. That depends on assortative mating 
patterns as well as family decisions on how much the wornan works over the family life cycle. 
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may also reduce the relative gain to being single and to marriages characterized 
by a specialized division of labor (Oppenheimer 1982). 

An unfortunate concomitant of defining economic independence in terms of 
earnings equality is that the significance of the absolute amount of earnings 
involved and of earnings adequacy almost drops out of the picture. Take, for 
example, the argument that 


the hypothesis about the decline in marriageable males is really an extension of the women’s 
independence argument, since women’s independence is a function of women's earning 
power relative to men's earning power Women’s independence can increase either because 
women’s earning power goes up faster than men’s or because it goes down more slowly than 
men's ... In principal, the independence argument can account for declines in marriage 
among men and women at all points 1n the income distribution (McLanahan & Casper 
1995, pp. 34-35). 


Following this logic, it is sometimes suggested that the main reason that 
African Americans are married in much smaller proportions than whites is 
not their generally poorer and more unstable economic position compared to 
whites. Rather, the relative earnings of African American women are much 
higher than those of white women (Farley 1988). Similarly, measuring in- 
dependence in the analysis of marital instability with earnings ratios alone, 
without also including some measure of absolute income, tends to equate earn- 
ings equality with women's economic independence, no matter how low the 
income of each partner is. Moreover, emphasizing income ratios per se ob- 
scures the underlying determinants of these ratios and discourages research 
into them and hence impedes the achievement of a greater understanding of the 
dynamics of social change and socioeconomic differentials. This emphasis also 
goes against the original rationale of the independence hypothesis, which was 
that women's increasing earning power increased their economic independence 
of men thereby reducing the gain to marriage (Becker 1981). It was not that 
the deteriorating labor-market position of men somehow increased women's 
economic independence, even if the women involved were themselves in an 
extremely weak labor-market position. If not dependent on their husbands or 
another partner, many women with very poor labor market prospects will still 
be partly or wholly financially dependent on other sources of income-—family 
support or welfare transfers, for example. This seems to be a situation where 


"This perspective also promotes such rhetorical questions as: “If the earnings of white women 
rise, compared with those of white men, will we find that white families increasingly resemble 
current black families? Two decades from now, will the majority of white children be born to 
unmarried women and raised in families headed by their mothers? Will 30% or 40% of white 
children live below the poverty line?" (Farley 1988, p. 491) This seems like an exceedingly 
strange outcome to posit for a rational choice model. 
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equality at a very low level reduces the gain to marriage but not because it 
signifies a woman's independence—certainly not in the sense of indicating 
economic self-sufficiency. In addition, the argument attributes far more power 
to women in the decision to marry than they actually possess. Many women 
in a poor labor-market position will not be very attractive marriage prospects 
themselves—either to men in a better labor-market position or to those in an 
equally poor position. 


Coping with Risk and Change 

Most theories, and perbaps economic theories most of all, make certain sim- 
plifying assumptions in order to facilitate the process of theory construction 
and empirical testing. These assumptions, however, can often provide the 
Achilles heel of a theory. In the case of the specialization model of marriage, 
the family's environment generally seems to be considered unvaryingly benign, 
and a family's needs and goals are assumed to be relatively constant over its 
developmental cycle. Questioning these assumptions, however, raises major 
doubts about some of the presumed "efficiency" of the specialization model 
of marriage. For example, although rarely considered in explications of the 
theory, historical research on the family indicates that extreme sex-role spe- 
cialization in marriage is essentially a high-risk and inflexible family strategy 
unless accompanied by supplementary support mechanisms. Even with such 
supports, specialization often entailed considerable individual and social costs. 
An inherent problem is that the temporary or permanent loss of one specialist 
in a family can mean that functions vital to the well-being of the complemen- 
tary specialist and children are not being performed. Husbands/fathers can die 
or become ill or disabled; they can lose their jobs and have difficulty finding 
another one; they could desert the family for a variety of reasons or become 
an alcoholic; and so on. The result is that the family is left without its ma- 
jor source of income. Except for employment-related shifts, there are similar 
problems involving the wife-mother specialist. In that case, there could be no 
one to take care of the children or the home. Specialization may be a feasible 
strategy in a large extended family household where no particular individual 
is indispensable because of the redundancy in personnel that can characterize 
such a system. However, for independent nuclear families and their individual 
members, specialization entails considerable risks. 

Extreme sex-role specialization is also not a very flexible way to deal with 
the varying needs of nuclear families over their developmental cycle. Since 
individuals' consumption needs and productive capabilities vary markedly by 
age, abasic feature of nuclear families is that theratio of consumers to producers, 
and hence the family's level of living, can vary substantially over the family's 
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developmental cycle (Berkner 1972, Oppenheimer 1982, Lee 1983). Hence, 
specialization involves a potentially serious inflexibility in dealing both with 
changes in a family’s internal composition and with the stresses posed by its 
environment. 

The large literature on family history in Western societies indicates that a 
variety of strategies were developed to maintain economic stability over the 
family’s developmental cycle and in the event of the temporary or permanent 
loss of specializing parents (Oppenheimer 1982, Ch. 9, Lee 1983). In the past, 
many of these strategies involved utilizing the productive labor of children, 
daughters as well as sons (Rowntree 1922, Anderson 1971, Haines 1979, Goldin 
1981, Tilly & Scott 1978). The evidence also indicates that utilizing the labor 
of one’s children could exact substantial costs. A sufficient number of children 
old enough to make an economic difference was generally not available until 
the middle or later stages of the family cycle. Families who temporarily or 
permanently lost the contribution of the father early in the family cycle were 
not greatly helped by such a strategy. And if the mother was lost when the 
children were young, the family might break up, with children being parcelled 
out among relatives or even going to orphanages. In general, this economic 
reliance on one's offspring often led to a pattern of "life cycle poverty" where 
periods of poverty and comparative plenty alternated over the life cycle of 
workers (Rowntree 1922). Another well-known disadvantage to the extensive 
employment of children to supplement their family's income was that it tended 
to discourage schooling and hence had a negative effect on children's adult 
socioeconomic status (Perlmann 1988, Goldin 1981, Parsons & Goldin 1989). 

Even aside from the drawbacks of using children's work, the economic advan- 
tages of this strategy were eventually bound to decline during industrialization. 
As the structure of demand shifted to a much more skilled labor force and 
adult male earnings correspondingly rose, the potential relative contribution 
of the unskilled labor of the family's children declined, particularly in middle 
class families. This suggests that other equilibrating mechanisms were likely 
to develop. Hence, from an historical perspective, the rise in married women's 
employment might be viewed as a functional substitute for the work of their 
children, facilitating the more extensive schooling of the next generation and 
thereby fostering intergenerational upward social mobility. In addition, wives' 
employment is not limited to the later stages of the family's developmental cy- 
cle, and wives, as adults, usually have educational attainments roughly similar 
to those of their husbands, and hence can command much higher wages than 
can their unskilled children. Moreover, recent research indicates that wives' 
employment currently plays an important role in offsetting a less favorable 
earnings position of their husbands (Cancian et al 1993, Levy 1996). 
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CONCLUSION 


Although the popularity of the women’s economic independence explanation 
of marriage behavior remains strong in the 1990s, this review of the literature 
found little real empirical support for the hypothesis. Micro-level event-history 
analyses that follow cohorts throughout their young adulthood generally show 
that women’s educational attainment, employment, and earnings either have 
little or no effect on marriage formation or, where they do have an effect, find 
it to be positive, the opposite effect of that hypothesized. The only support for 
the hypothesis from multivariate analyses is found in aggregate-level studies. 
However, aggregate-level analyses have serious drawbacks for investigating 
this micro-level hypothesis, not the least of which is establishing a convincing 
causal ordering. 

The juxtaposition of time series data of marriage behavior with women’s em- 
ployment appears to provide convincing evidence for the hypothesis, but upon 
closer examination this evidence becomes much less persuasive. In the case of 
marriage formation, a large part of the support for the hypothesis results from 
the confounding of delayed marriage with nonmarriage. The independence 
hypothesis is basically an argument about nonmarriage but, for white women at 
least, the major trend has been an increase in delayed marriage. The hypothesis 
could presumably be modified to say that the gain to marriage increases with 
age, but it is bound to lose a lot of its punch if, instead of making predictions 
about the gain to marriage in general, it is reduced to making predictions about 
the gain to marrying at age 22 versus age 20. But, in any event, that case has 
never even been made. e 

An additional major problem ıs that the apparently high correlation of the 
various time-series trends is largely a function of the starting point chosen. This 
is almost invariably sometime in the 1950s or early 1960s—i.e. the era that 
produced the baby-boom, early and universal marriage, lowered divorce rates, 
and so on. However, when one pushes the time series farther back, it is obvious 
that the divorce rate had been rising for many decades before married women’s 
employment started its rapid rise. And as far as marriage timing is concerned, 
the period before World War H exhibited long-term fluctuations in age at mar- 
riage and in its variability, all within the era when sex-role specialization in 
marriage was typical. Hence, the decision to use the benchmark of the 1950s 
and early 1960s was critical for how the linked time series have been evaluated. 

While it is widely recognized that the marriage behavior of the early postwar 
period is statistically atypical of American historical patterns, it has nevertheless 
achieved a moral stature that seems to justify its use as the model against 
which more recent family behavior is evaluated, often in a pejorative light. 
For example, the desirability of an early marriage is implicit when delayed 
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marriages are described as a “retreat” from marriage or a “decline” in marriage. 
However, during the period when age at marriage actually was early, there was 
no such universal approval of it,? and it is well known that marriages of the 
young are much more unstable than marriages formed at older ages. 

At the same time that the empirical support for the hypothesis is weak, its 
theoretical underpinnings are by no means immune from questioning. First, the 
notion of the efficiency of specialization and exchange as a basis of the marital 
relationship largely depends on certain assumptions regarding the stability and 
benevolence of the environment. However, real-world conditions indicate that 
specialization can be a risky and inflexible strategy for maintaining a family's 
economic well-being over time; achieving this has often been a result of having 
more than one earner in the household. What has changed is the identity of 
the additional earner(s)—now it is the wife, whereas historically it was more 
likely to be the family's children. Another major problem lies in the tendency 
to equate independence with equality of earnings. However, this approach fails 
to appreciate the economic gains to marriage where earnings are approximately 
equal, and the low gains that may result from a weak labor-market position, 
whether or not earnings are equal. Moreover, the underlying causes of low 
gains to marriage may be obscured when earnings ratios become the major 
focus rather than the conditions producing these ratios. 

In sum, this review of the economic independence argument provides little 
support for the extensive explanatory claims made for the hypothesis. The idea 
that specialization creates the gain to marriage, and hence that the desirability 
of marriage tends to disappear once women can earn a living wage, reflects a 
rather simplistic view of the basis of the marital relationship. Married women's 
employment undoubtedly has an effect on marriage, but we are more likely 
to understand that effect if a model of marriage is developed that is more 
multidimensional and flexible in its view of social roles. Certainly the fact that 
almost 7096 of married women now work and thus play an important part in 
their family's economic welfare suggests that a model of marriage needs to be 
developed to reflect observed behavior rather than the marital role behavior of 
the nineteenth century. 
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For example, the rationale that Preston & Richards provide for their 1975 article investigating 
the relationship of women’s employment and marriage timing is to understand factors that might 
raise the age at marriage because of the “deleterious effects of its [the population's] early marrying” 
at that time (p 209) 
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ABSTRACT 

Humans are inexorably driven to search for order and meaning in their own and 
others' lives; accounts are a major avenue for sociologists to depict and under- 
stand the ways in which individuals experience and identify with that meaning 
and their social world. The accounts concept has a solid foundation and history in 
early sociological analysis and research. The current work on accounts focuses on 
"story-like" interpretations or explanations and their functions and consequences 
to a social actor's life. The concept is useful for gaining insight into the human 
experience and arriving at meanings or culturally embedded normative expla- 
nations. This concept deserves greater explicit attention in sociology and is in 
need of further theoretical development and stimulation. I argue that sociologists 
should embrace the concept of accounts; the foundation is set for a resurgence of 
work on accounts in sociology. 


INTRODUCTION 


Recently in the social sciences, scholars have been encouraged to collect and 
interpret stories that people tell about their lives (Bochner 1994, Bruner 1986, 
1990, Coles 1989, Gergen & Gergen 1987, 1988, Harvey et al 1990a, Howard 
1991, Maines 1993, Mishler 1986, Orbuch et al 1993, Polkinghorne 1988, 
Sarbin 1986). Robert Coles (1989), in The Call of Stories, suggests that stories 
are everyone's rock-bottom capacity—a universal gift to be shared with others. 
Accounts and other related concepts, such as stories and narratives, represent 
ways in which people organize views of themselves, of others, and of their 
social world. 
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Sociologists first advanced the concept of accounts over two decades ago 
(Lyman & Scott 1970, Scott & Lyman 1968). According to Scott and Lyman, 
accounts are verbal statements made by one social actor to another to explain 
behaviors that are unanticipated or deviant. This early work was influenced by 
Goffman’s (1959) arguments about how people present themselves to others, 
often in a self-protective manner, and by Garfinkel’s (1956) analysis of the con- 
ditions necessary for successful status degradation. The concept also developed 
along parallel and sometimes overlapping lines with classical attribution theory 
and symbolic interactionism. 

The more recent work on accounts still recognizes these verbal and written 
statements as social explanations of events, but it places far less emphasis on the 
construction of accounts to justify unexpected or disrupted social interaction. 
Current scholars broaden the focus to consider “story-like” interpretations or 
explanations (Bruner 1990, Harvey et al 1990a, Maines 1993, McAdams et al 
1996, Surra et al 1995, Veroff et al 1993b). In many of these programs of 
research, respondents are asked to explain and interpret a stressful event (e.g. 
divorce, sexual assault) or the development of a relationship (e.g. courtship, 
then marriage). Recent theoretical developments also emphasize the process 
by which individuals develop an account (account-making), while continuing to 
focus on the specific contents of accounts, the major theme in the earlier work. 
For example, Harvey et al (1990a,b) focus on the motivations and reasons 
behind the presentation of accounts and the influence of others (audience) in 
the account-making process. 

Although early sociological analysis and research were important in defin- 
ing and conceptualizing the term account, the phenomenon has received only 
modest attention in mainstream sociology since the 1970s.! Whether under the 
heading of account-making, storytelling, or narration, the development of the- 
ory and empirical research has flourished largely in other disciplines (e.g. psy- 
chology, literature, communications, anthropology, history) and has increased 
rapidly in the last two decades. A major premise of this chapter is that the work 
on accounts has a prominent history in the discipline of sociology. This legacy, 
along with current theoretical and empirical work on accounts, can make a 
valuable contribution to contemporary sociology. 


1 After an zxtensive review of four mainstream sociology journals ( Social Psychology Quarterly, 
American Sociological Review, American Journal of Sociology, and Social Forces) from 1990 to 
1996, I found only four articles that focused on accounts, personal narratives, or story-telling: 
Connell (1992), Ellingson (1995), Griffin (1993), and Jacobs (1996). Other journals that are highly 
relevant to sociology and consistently publish articles on these topics include Journal of Narrative 
and Life History, Symbolic Interaction, and Journal of Contemporary Ethnography Social Science 
History also published a series of five articles on the subject of narrative and the formation of social 
identities (Sewell 1992) 
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THE EMERGENCE OF THE ACCOUNT CONCEPT 
IN SOCIOLOGY 


The writings of sociologists such as Garfinkel (1956, 1967), Goffman (1959, 
1971), and Scott & Lyman (1968, Lyman & Scott 1970) led the theoretical 
development on accounts. In general these scholars were interested in three 
questions: (a) What is the content of accounts? (b) Under what conditions do 
people present accounts?, and (c) Under what conditions do people "honor" 
or accept the accounts of others? The norms of a situation, the actor, and the 
audience all determine the content of an account and whether it is accepted in a 
given social situation. As to when individuals give accounts, sociologists held 
two distinct viewpoints; they regarded accounts (1) as justifications or excuses 
for socially undesirable or problematic occurrences, and (ii) as a means of 
weaving together disparate social events. 

Goffman's (1959, 1971) seminal work, The Presentation of Self in Everyday 
Life, exemplifies the conceptual foundation for contemporary research on how 
people tactically present themselves to others when social acts have potentially 
negative implications. When an individual commits an offense, a powerful 
script for account-making is set in motion: 


Regardless of the particular objective which the individual has in mind and of his motivation 
for having this objective, rt will be in his interests to control the conduct of others, especially 
their responsive treatment of him (Goffman 1971, p 3). 


This control is accomplished by providing accounts to nullify any negative 
implications flowing from appearance and demeanor. Goffman also argued that 
self-presentations or "performances" are socially constructed for the existing 
public audience to maintain social identities. 

Whereas Goffman was concerned with the use of accounts to counteract so- 
cially undesirable or problematic experiences, Garfinkel (1956, 1967) argued 
that accounts were a more general aspect of social life. Garfinkel acknowl- 
edges that such explanations or interpretations are more salient to others when 
the circumstances become problematic, but he also stresses that individuals or- 
ganize and manage their everyday affairs so as “to be accountable" to others for 
all actions and behaviors. Although Garfinkel considered accounts a regular 
day-to-day experience, he wrote most prominently (1956) during the McCarthy 
hearings ofthe 1950s about how people's self-worth may be degraded in societal 
or institutional rituals. 

Scott & Lyman (1968, Lyman & Scott 1970) were the most explicit in delin- 
eating account strategies for different social situations and in discussing how 
others might react differentially to the accounts. Like Goffman, they argued 
that accounts are presented to others to explain potentially problematic events 
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or behaviors. They defined an account as “a linguistic device employed when- 
ever an action is subjected to valuative inquiry” (1968, p. 46). Their writing 
focused on the notion that at the microsociological level, daily interactions and 
the use of talk are the significant mechanisms by which individuals neutralize 
negative acts or their consequences. They discussed two general types of ac- 
counts that contain socially approved vocabularies: justifications (whereby one 
accepts responsibility for the act in question, but denies the pejorative quality 
associated with it) and excuses (whereby one admits that the act in question is 
bad or wrong, but denies full responsibility). Both types are presented to others 
to protect self-esteem, social status, or social order. 

Many of the substantive concerns formulated by sociologists such as Scott 
& Lyman, Goffman, and Garfinkel have been extended or refined in other pro- 
grams of research. For example, Semin & Manstead (1983) refined the notion 
of accounts to examine specific practices, which they call collectively “the ac- 
countability of conduct" (p. x). These involve the repair of disrupted social 
interaction in a variety of social situations. Like Scott & Lyman, Semin & 
Manstead rely on talk as the medium for thess accounting practices. Also em- 
ploying the Scott & Lyman (1968) framework, Blumstein (1974) examined the 
conditions under which other social actors would or would not honor accounts. 
On the basis of experimental data, Blumstein found that the content of an ac- 
count predicted more strongly than the negative act itself whether individuals 
could neutralize negative actions and their consequences. Similarly, the work 
of communication studies scholars McLaughlin, Cody, and colleagues (1990, 
1992) builds directly on the ideas of Scott & Lyman (1968) and Blumstein 
(1974). Their research continues to elaborate on the various types of justifica- 
tions that people communicate to others following failures or social predica- 
ments and the consequences of specific justifications to the maintenance of 
social interaction and identities. 


CURRENT VIEWPOINTS ON ACCOUNTS 


The current literature on accounts differs in several ways from the literature 
in which the concept of accounts initially emerged. The early work on ac- 
counts was intrigued with the notion that individuals present impressions of 
themselves to others to maintain social interaction as well as personal status 
and esteem. This work was clearly oriented toward understanding deviance or 
disruptions in social interaction and the consequences of accounts for the nature 
and maintenance of that interaction. 

The current work on accounts and narratives is more social psychological in 
nature and focus; the functions and consequences of accounts to a social actor’s 
life have become more prominent (Bochner et al 1997b, Harvey et al 1990b, 
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McAdams et al 1996, Surra et al 1995, Weber et al 1987). In this work, accounts 
are viewed as story-like constructions that contain individuals’ recollections of 
events, including plot, story line, affect, and attributions. Individuals continu- 
ally update and reflect on these accounts, on the basis of feedback from others 
and the collective stories within which individuals reside (Harvey et al 1990a, 
Heath 1983, Maines 1993, Orbuch et al 1993, 1994). 

An example of the foregoing argument is provided by Harvey et al (1990); 
they contrast current viewpoints of accounts with that of an interesting study 
by McLaughlin et al (1983), concerning students’ retrospective reproaches for 
potentially blameworthy actions. McLaughlin et al (1983) describe a reproach 
as a projected account, meaning that the questioner offers it as a possible a priori 
justification to the person who must answer for some problematic behavior. An 
example of a reproach that was reported by a wife who was late to meet her 
husband is "My husband said to me ‘Where the hell have you been? Did you 
forget to wind your watch again?" Current scholars interested in story-like 
accounts (Harvey et al 1990b) would examine the reproach, but would also 
emphasize the rest of the story that likely exists in the relationship between the 
husband and the wife and the consequences and functions of that fuller story. 
He may, for instance, have developed a strong belief and accompanying story 
that his wife is engaging in extramartial relationships. This story may include 
denial, anger, sadness, and confusion along with reports of his efforts to cope 
with the situation given this meaning. On the other hand, she may believe that 
her husband is jealous and unreasonable in his expectations and inferences; she 
too may have stories to go with that theme which focus on why he is so jealous 
(perhaps going back to alienated relations with parents or to betrayal in a past 
relationship). 

Current theoretical viewpoints also emphasize that accounts are not merely 
social constructions to protect the self; they also (a) give individuals a greater 
sense of control and understanding of their environment, (5) allow individuals 
to cope with emotionally charged and stressful events, (c) produce some degree 
of closure, (d) provide a greater sense of hope and will for the future, and 
(e) establish order in daily relational experiences. McAdams et al (1996) state 
that "beginning in late adolescence and young adulthood, men and women 
living in modern societies seek to construct more-or-less integrative narratives 
of the self to provide their lives with a semblance of unity and purpose.” 

An explanation of the consequences of account-making for individuals was 
first inspired by Weiss's (1975) powerful analysis of individuals' accounts fol- 
lowing marital separation. In his book, Weiss proposed that newly separated 
individuals develop accounts to explain and understand what happened and 
why. More importantly, these accounts were psychologically effective in deal- 
ing with the stress and distress accompanying separation. The development of 
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the account was vital to the chain of events in which a person achieved closure 
about the loss of a relationship and motivation to move on in life (Weiss 1975). 

The current work on accounts also acknowledges that accounts may reflect 
culturally embedded normative explanations. Although the original content of 
accounts, including reports of plans and intentions, style of presentation, and 
context of presentation ts of primary importance to most scholars working with 
accounts, these explanations are seen as serving as much a communicative- 
persuasive mission as they do a function of explanation and self-insight. For 
example, Harre et al (1985) argue that in producing their accounts, actors are 
displaying knowledge of the ideal ways of acting and ideal reasons for doing 
what they have done. These investigators collect accounts from various popu- 
lations or cultures, such as football fans, and then search through the accounts 
for descriptions of what should happen. Thev then represent this material as 
a system of rules representing a system of knowledge or beliefs. A similar 
position is adopted by Shotter (1984) who focuses on how people make their 
behavior accountable in a moral world. He stresses how people must talk about 
themselves in terms of the social order and socially constructed morality in 
which they exist. People must account for their experiences in ways that are 
intelligible and legitimate in their current social context. 

In contrast to the early literature on accounts, the current literature also em- 
phasizes the value of communicating accounts to others as a mechanism for 
coping with major life events (Harber & Pennebaker 1992, Harvey et al 1990a, 
1992, Pennebaker & Harber 1993, Silver et al 1983, Weiss 1975). Failure to 
engage in this account-making process can lead to chronic problems, including 
psychosomatic illness (Orbuch et al 1994). In his book, Opening Up, James 
Pennebaker (1990) documents many of the negative psychological and physical 
effects (e.g. long-term physical problems such as high blood pressure, depres- 
sion, prolonged grief) associated with not confiding in others about traumatic 
experiences and losses. More recently, research has examined the activities 
of account-making and confiding across a wide array of stressful events (e.g. 
transition to a nursing home, sexual assault, Vietnam War). Evidence suggests 
that the interaction/social process of confiding accounts to others has positive 
consequences for individuals’ psychological and physical well-being (Harvey 
et al 1991, Harvey et al 1995, Lichtman & Taylor 1986, Orbuch et al 1994, 
Pennebaker 1990, Weber et al 1987). 

This broad and more recent conceptualization of accounts has also been ap- 
plied extensively to research and theory in interpersonal relationships (Berscheid 
1994, Burnett & McGhee 1987, Fincham 1992, Harvey et al 1990a, 1992, 
Matthews 1986, Murray & Holmes 1994, Orbuch et al 1993, Surra et al 1995, 
Veroff et al 1993a,b). Accounts of close relationships contain rich and detailed 
information about various sources of relational and self knowledge, such as the 
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nature and dynamics of a present relationship and preexisting normative beliefs 
and rules about what makes a relationship and how to behave appropriately in 
that relationship (Surra & Bohman 1991, Surra et al 1995). 

It is likely that the concept of accounts was broadened for both conceptual 
and methodological reasons. The accounts-as-stories framework is better able 
than standardized survey instruments to represent the rich, complex, interwoven 
reports that investigators are currently collecting, often with populations who 
are facing major life stressors. Accounts may also give a fuller understanding 
than a checkmark on a questionnaire regarding the ambivalences, uncertainties, 
and angsts that are a day-to-day reality. “If we wish to learn the realities of 
personal relationships, we would do well to pay attention to such natural forms 
these realities take on" (Weber 1992). It also makes sense to complement, 
rather than supplant, the structured questionnaire with a story format because 
respondents may be comfortable in “telling their stories” or describing the 
meaning they make of their experiences as naturally unfolding events, rather 
than as structured responses (Baumeister & Stillwell 1992). 

Veroff et al (1993a) argue that “direct questions can elicit historical truth 
to some extent, but they often engage much more of the person’s social self- 
presentation and hence are primarily useful in understanding situations where 
self-presentations are particularly salient” (p. 439). Further, direct questions 
often fail to examine how individuals perceive their experiences to be related 
to social, cultural, and personal circumstances (Mishler 1986). In contrast, 
accounts are more likely to reveal nonconscious motives and meanings and to 
illuminate individuals’ interpretations in a social, cultural, and personal context. 

Weber (1992) gives a poignant example of the importance of personal context 
in the following account of a 67-year old woman describing her painful divorce 
and its aftermath. 


I can honestly say that today I am a resourceful person, a warm and caring person, without 
any bitterness. I am retired, living alone 1n my little house and very involved with family 
and a few friends. Oddly, I am not mto church acnvities. I do help individuals when I can 
... but I can’t say I'm a “do-gooder.” 


I am a happy person, truly, at this stage of my life. I do remember good things about our 
marriage—-some of the funny things, the sweet, thoughtful gestures J also remember the 
very bad times, but I don't dwell on them, I remember being carried over the threshold of 
each new apartment or house with each new move I remember a Christmas stocking tied 
to the foot of my hospital bed after the birth of our first child on Christmas eve My feelings 
now are, weren't we lucky to have had so much once. Perhaps now at 67 I'm remembermg 
more often (Weber 1992, p. 186) 


The self-descriptions of herself (warm, caring, self-confident, resourceful) and 
the current meaning of her divorce (e.g. she does not want to dwell on the bad 
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times, weren’t they lucky to have so much cnce) are better understood only 
after we consider the context of the larger story of her loss. 

These written or verbal accounts can be left in the form of a story and re- 
ported as such. Alternatively, the material can be coded and treated within 
quantifiable categories (Antaki 1988, Fletcher 1983, Harvey et al 1988, Or- 
buch et al 1993, Veroff et al 1993b). Berscheid (1994) suggests that this latter 
task can be interpretive and must be planned and executed carefully. Mish- 
ler (1986), Riessman (1993), Veroff et al (1993b), Surra (1988), and Weiss 
(1994) have written extensively on various stages of the gathering, coding, 
and analysis of accounts. Nonetheless, future research will require greater 
emphasis on and discussion of strategies for coding and analyzing accounts. 
The coding and analyzing of accounts can be an arduous and time-consuming 
task. 


RELATED THEORETICAL CONCEPTS 
AND DEVELOPMENTS 


A number of related theoretical concepts and developments have been instru- 
mental in expanding current knowledge of accounts and their utility in soci- 
ological analysis. A concern with subjective interpretations of social reality 
and the construction of meaning through social interaction has a long, varied 
history in sociology. Even the early ethnographic research (Cressey 1932, Park 
& Burgess 1921) from the Chicago School of sociology was motivated by ques- 
tions about process and interaction that are dominant themes in much of the 
Work on accounts and narratives (Abbott 1992). 


Communicated Language 


The foundations of accounts are similar to the classic concepts of “vocabularies 
of motive" postulated by C Wright Mills (1940) and the "grammar of motives" 
proposed by Burke (1945). Both Mills and Burke examined language as a 
vehicle by which individuals communicate explanations of their actions. Burke 
(1945) was concerned with the grammar that is presented and its persuasive 
tendency rather than with the historical reality of the action and what motivated 
it. Burke's logic is similar to that emphasized in current frameworks of accounts; 
the focus shifts from individuals' reasons for events or actions, to account- 
making as a persuasive endeavor designed to convince self and other of the 
credence of one's reality. In an experimental study, Orbuch et al (1992) found 
that respondents assigned trait inferences to others (e.g. respectable, selfish, 
mature, trustworthy) and formed impressions of others as people (likability) 
based on their presented accounts. Respondents were also persuaded to interact 
or not interact with others based on their presented accounts. 
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In contrast to Burke and in a way similar to Goffman (1959) and Mead 
(1934), Mills (1963) emphasized that motives originate not within the individ- 
ual but from the social setting and audience at hand. “Motives are accepted 
justifications for present, future or past programs or acts” (p. 443). Appropriate 
motives and vocabularies also differ with culture or group context. According to 
Stokes & Hewitt (1976), Mills's “motive talk " and Scott & Lyman’s "accounts" 
are significant constructs because they provide a link between culture (collective 
understandings) and individual behavior. Accounts and motive talk can be seen 
as "aligning actions;" in the face of negative behavior, individuals align their 
behavior with culturally acceptable language to restore order and interaction. 


Deviance 


The vast literature on the sociology of delinquency or deviance is also germane 
to much of the early work on accounts as explanations for potentially blame- 
worthy actions (Sutherland 1955, Sykes & Matza 1957). A main theme in this 
literature is that individuals give justifications or rationalizations for their de- 
viant behavior to deflect blame and protect self-esteem. Sykes & Matza (1957) 
report four techniques for neutralizing self-blame: (a) denial of responsibil- 
ity, (b) denial of injury, (c) denial of the victim, and (d) condemnation of the 
condemners. These techniques of neutralization follow deviant behavior and 
protect the individual from self-blame and the blame of others after the act. 

Many of these theoretical ideas run parallel to the concerns expressed in the 
early work on accounts, especially the writings of Scott & Lyman (1968). Like 
the scholars who were trying to explain the causes of deviant behavior, according 
to Scott & Lyman, accounts are presented in response to the violation of social 
norms and are "voiced" after a behavior to avoid detrimental consequences to 
self. 


Attributions 

The early work on accounts developed in close association with theory and 
research on attributional processes (Crittenden 1983, Felson 1981, Harvey et al 
1978, Heider 1958, Jones & Davis 1965, Weary et al 1989). Classical attri- 
bution theory focuses on the process by which individuals arrive at subjective 
explanations for actions and events. These explanations allow social actors to 
integrate and explain the events and occurrences around them. Fritz Heider's 
(1958) writing is significant for much of the past and present work on accounts. 
Heider viewed people as possessing, using, and often articulating a common- 
sense psychology of their understanding of interpersonal and other events. For 
example, people often explain their achievement of a task as a function of their 
own ability (internal cause), their own efforts (internal cause), conditions asso- 
ciated with the performance situation (external cause), or luck (external cause). 
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In the late 1970s, many attribution theorists began to broaden their focus 
and to formulate account-like concepts (Crittenden 1983, Felson 1981, Harvey 
et al 1978, Wiley & Eskilson 1981). Some years passed, however, before 
scholars (Harvey et al 1986, 1990a) began to appreciate more fully the value 
of examining attributions within the context of natural stories that people tell 
about their lives and linked attributions explicitly to the earlier work by Lyman 
& Scott (1970) and to Weiss’s (1975) account work on marital separation. These 
scholars were also influenced by a number of British social psychologists who 
had already embraced the concept of accounts and were analyzing the stages 
of account-making and the strategies by which individuals choose to perform 
or not perform specific accounts (Antaki 1987, Harre 1977, Schonbach 1980, 
1990, Shotter 1987). 

Currently, accounts are viewed as more than collections of disparate attribu- 
tions. Accounts are packages of attributions (including attributions of causality, 
responsibility, and blame, and trait ascriptions both to other and to self), tied 
together by descriptive and emotive material (Harvey et al 1992). Attributions 
concentrate more strongly on the cognitive aspects of judgments and respon- 
sibility than on the social processing of these statements; accounts emphasize 
the cognitive, emotional, and social aspects of these explanations. Further, 
accounts contain much more elaborate and more complex general information 
than do attributions, including but not confined to statements about affect-laden 
memories. 

Berscheid (1994) states that “the current use of account narratives appears 
to derive from two sometimes incompatible perspectives: the information- 
processing and the ethnographic perspectives" (pg. 99).? She maintains that 
the former tradition represents theory and research focused solely on social 
cognition (attribution theory). In contrast, the ethnographic approach focuses 
on the construction of meaning to make sense of one’s self and social world 
(Bruner 1990, Gergen & Gergen 1988). This distinction between meaning- 
making and information-processing is not clear and may be an unwarranted 
characterization of the literature on accounts. Certainly the area of social cog- 
nition (including classical attribution work in social psychology) has influenced 
the formulation of accounts as narratives, but the major part of recent work on 
accounts (Harvey et al 1990a,b, McAdams et al 1996, Orbuch et al 1993, Surra 
et al 1995) embodies principles from both perspectives. The current viewpoint 
on accounts as story-like constructions describes them as cognitive processes by 
which individuals create and organize meaning from their social world (account- 
making). 


2Berscheid labels accounts that are story-like ın nature as account narratives. This 1s contrasted 
with the more traditional use of the word “account” (e g. Scott & Lyman 1968) 


SOCIOLOGY OF ACCOUNTS 465 


Symbolic Interactionism 


As implied in the previous discussion of Goffman and Garfinkel, the work on 
accounts also has intellectual ties to symbolic interactionism (Blumer 1969, 
Cooley 1902, Fine 1993, Mead 1934, Stryker 1981, 1983). A common goal of 
these frameworks is to understand the role of subjective experience to human 
behavior, in the context of personal and social interaction. Further, the link be- 
tween individual and society through the construction of meaning is a powerful 
theme in both perspectives. 

In 1981, Stryker & Gottlieb wrote a chapter that compared symbolic interac- 
tionism and attribution theory. They argued that both theories share an interest 
in individuals’ subjective interpretations of their social world. However, sym- 
bolic interactionism is the broader of the two frameworks, because it addresses 
the interactional bases of such subjective interpretations and the sociocultural 
aspects of social behavior (Stryker & Gottlieb 1981). This distinction is legiti- 
mate but does not generalize to a comparison of accounts theory and symbolic 
interactionism. Recent work on accounts examines the social interactional pro- 
cesses by which individuals develop accounts and their meaning in the context 
of individuals’ personal, social, and cultural histories. 

Concepts such as self, identity, taking the role of the other, self-presentation, 
and generalized other, which were central to the early development and writings 
of symbolic interactionism (Blumer 1969, Cooley 1902, Mead 1934), are salient 
processes and topics 1n the work on accounts and narratives. Many argue that 
identity is generated or maintained through the process of storytelling (Crites 
1986, McAdams et al 1996, Polkinghorne 1988, Somers 1994, Surra et al 1995). 
LaRossa (1995) theorizes that although stories may be recordings of the past, 
they also function to create and shape the definitions that individuals create 
of themselves and of others. Hopper (1993) found that individuals’ accounts 
of divorce were not motivated by the actual events, feelings, and intentions of 
their experiences, but instead were driven by the individuals' identities in the 
divorce process (initiator, noninitiator). 

Currently, although symbolic interactionism includes many different ap- 
proaches (Burke & Reitzes 1991, Couch 1992, Denzin 1985, Simon 1995, 
Thoits 1991), the perspective as a whole has made some progress in bridging 
the gap between macro and micro sociologies (Fine 1992). As an example, cur- 
rent frameworks incorporate the influence of structured roles and identities as 
constraints in interpersonal interaction and in the process of meaning construc- 
tion (Stryker & Stratham 1985).? This elaboration has led to fragmentation but 


3Stryker and his colleagues (1983, Stryker & Stratham 1985) argue that a structural symbolic 
interachonism 1s more appropriate for sociological social psychologists interested in how interac- 
tions are constructed and become meaningful according to the roles and structured 1dentities that 
we possess 
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also to a general acceptance and integration of symbolic interactionist doctrines 
into mainstream sociology (Fine 1993). Accounts scholars may want to pay 
attention to the efforts of symbolic interactionists; a future goal for scholars 
studying accounts is to increase their efforts at synthesizing and integrating 
macrosociological concerns into their frameworks and analyses. 


Narratives 


At present, there are numerous approaches to the study of narratives; the lit- 
erature cuts across various theoretical and disciplinary boundaries. Riessman 
(1993) states that “the study of narrative does not fit neatly within the bound- 
aries of any single scholarly field” (p. 1). Further, while the concept of accounts 
can be linked to developments in mainstream sociology and social psychology, 
narrative analysis was largely inspired by the “crisis of confidence” in social 
science promoted by poststructuralism, postmodernism, and deconstruction 
(Bochner et al 1997a). Among social science historians, the use of narratives 
“marks a global change in the relationship of the social sciences to the human- 
ities” (Sewell 1992, p. 480). “Over the past two decades, historically minded 
social scientists have gone from a general disdain for the humanities toward 
curiosity about, respect for and increasing intellectual collaboration with them. 
Narrative is one of the emerging points of intersection” (p. 481). 

In general, the narrative perspective moves away from describing language as 
a means to discover or mirror reality to the view that language and meaning are 
an ongoing and constitutive part of reality. Narratives play a “dual role"; they 
represent or reconstruct the past and they are a “sort of ‘independent variable’ 
active in shaping the past" (Hart 1992, p. 634). For narrative scholars, there is 
no "real" event, such as a relationship break-up, whose objective nature exists 
and can be learned or inferred through various sources. Instead, narratives are 
"real" events as presented, and narrative analysis pays special attention to the 
form, coherence, and structure of these stories (Steinmetz 1992). 

Theory and research on accounts and on narratives are quite similar, even 
though these two concepts have been analyzed in quite different ways. First, 
both concepts emphasize that these story-like conceptions are first-person in- 
terpretations or explanations. Systematic research within each framework has 
focused on rhe process by which individuals construct these stories and on the 
recognition that stories formulate, control, and represent self, other, and rela- 
tionships (Harvey et al 1990a, LaRossa 1995, Orbuch et al 1994, Surra et al 
1997, Sternberg 1995, Veroff et al 1993b,c). “People construct identities (how- 
ever multiple and changing) by locating themselves or being located within 
a repertoire of emplotted stories" (Somers 1994, p. 614). Sternberg (1995) 
claims that people organize their views of relationships in story schemas; future 
relationships are then shaped and controlled through the process of storytelling. 
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Both concepts also highlight the idea that these subjective explanations de- 
velop and evolve in the context of cultural and social factors. Whether the story 
is constructed to fit the social audience or whether it is constructed privately 
and influenced by an imagined audience or cultural script (Harvey et al 1990a), 
the social nature of these stories is explicit and dominant in writings on both 
narratives and accounts. Further, individuals' positions or statuses (e.g. class, 
gender, race) socially influence the appropriate stories to tell and the reasons 
for telling them (Barbre et al 1989, Veroff et al 1993a). Rosenwald & Ochberg 
(1992) stress the importance of cultural discourse in understanding the narra- 
tives that individuals present; cultural conditions may constrain individuals' 
behaviors as well as the language and stories they produce. 

Finally, there is widespread consensus about the necessary components of 
both narratives and accounts. Despite some variation, narrative scholars argue 
that the retelling of events follows a temporal sequence and that plot, character, 
theme, and emotion are evident in these stories. The temporal sequence of 
events is also addressed in the work on accounts. À dominant theme throughout 
both literatures is that narrative order allows an individual to make sense of 
events, situations, or identities. For example, in a study of the communication 
dynamics found in epilepsy self-help groups, Arnston & Droge (1987) suggest 
that narrative order (À happened, then B) makes the events portrayed seem 
more understandable and predictable. From the standpoint of the other social 
actor, the story is more likely to be comprehended and accepted if a sequential 
plot exists and if order is evident. 

Although the two perspectives have much overlap in concerns and basic 
assumptions, there are issues that differentiate the two approaches. An account 
and a narrative differ: The standard meaning of narrative is that of an individual 
telling his or her story, usually orally (Gergen & Gergen 1983); account-making 
includes both the telling of a story in public and those activities that occur in 
private such as diary keeping, rumination, and other intrapersonal processes. 
These private activities are defined as social acts in light of Mead's (1934) view 
of the imagined reactions of others and their influence on all actions (Harvey 
et al 1992), but they are not necessarily literally told to others. Antaki (1987) 
distinguishes between performable accounts (those which we present to others) 
and unperformable accounts (those which we do not present to others perhaps 
because of secrets or embarrassing information). In contrast, narratives are 
public and involve overt social behavior or an audience. 

Another difference between these two story-like concepts is the degree to 
which each develops in response to a stressful or imposing event. Current views 
of accounts embrace a wider array of situations than in the past (positive and 
negative events); however, a major assumption is that people tend to engage in 
account-making when they are troubled by imposing problems or stressful life 
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events (Harvey et al 1990b, Huston et al 1981, Orbuch et al 1993). A stressful 
or difficult situation is not required for the telling of a narrative, especially in the 
context of a life narrative. Further, since individuals may be more likely to assess 
and explain causality in the face of a stressful event, attributions become an 
important component of the conceptualization of accounts. In contrast, narrative 
perspectives do not emphasize attributional statements. 


Summary 

In general, the fundamental questions in the literature on communicated lan- 
guage and cn deviance are strikingly similar to those in the early accounts work. 
Individuals present impressions of themselves to others in the face of poten- 
tially blameworthy actions to maintain social interaction and personal status 
and esteem. An analysis of attribution theory, symbolic interactionism, and 
more current viewpoints of accounts also yields a shared premise—“‘that indi- 
vidual behevior is guided by the subjective contruction of reality rather than 
by objective properties of external stimuli” (Strker & Gottlieb 1981, p. 427). 
It is important to note however, that underlying differences also emerge in this 
comparison. Itis also the case that the terms “narrative” and “account as a story- 
like construction” are often used to signify the same phenomenon or process. 
Although substantively unique, the distinct meanings of these two concepts are 
often difficult to disentangle. 


ACCOUNTS AND THE GOALS OF SOCIOLOGY 


Accounts as story-like conceptions have much to offer and contribute to soci- 
ological incuiry. For sociologists, especially sociological social psychologists, 
accounts can be an important means and product of inquiry because these sto- 
ries treat the human being and his/her mind as invaluable to understanding and 
explaining social behavior. 

Ewick & Silbey (1995) discuss three mechanisms by which narratives enter 
scholarly research: “Narrative can be the object of inquiry, the method of inquiry 
or the product of inquiry” (p. 201). I employ this classification as a heuristic 
device to illustrate the potential value of accounts for sociological inquiry. 
Further, challenging questions for future sociological work are addressed within 
each section. 


The Object of Inquiry 

A vast literature examines accounts as the object of inquiry. This work focuses 
on the nature of accounts (Fincham 1992, Harvey et al 1992, Mishler 1995, 
Schonbach 1990), how they are formed (McLaughlin et al 1990), and their 
function in individuals’ lives (Gergen & Gergen 1987, Harvey et al 1990a, 
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Weber et al 1987). The study of accounts as the object of inquiry also allows 
sociologists to observe how individuals impose order and subjective meaning 
on the experiences around them (Riessman 1993). Sociologists, especially 
social psychologists, have always been interested in the processes by which 
individuals establish balance in their interactions with others. Whether that 
balance is necessary as a result of “disrupted or fractured” social interaction 
(Goffman; Scott & Lyman), because of a stressful event (Harvey et al 1990a, 
Orbuch et al 1993, Pennebaker & Harber 1993), or as part of everyday living, 
the process of imposing order by developing an account conveys important 
information about this process to sociologists. 

It has sometimes been argued that accounts are reported by people only when 
investigators ask them explicitly to do so. Others contend that individuals de- 
velop accounts for anything that is called on by self or other to explain, whether 
that be researcher driven or not (Harvey et al 1990a,b). The former argument 
has also been strongly challenged in work on spontaneous or unsolicited inter- 
pretation (Fletcher 1983, Harvey et al 1988, Holtzworth-Munroe & Jacobson, 
1985). This empirical work suggests that it is possible to code people’s free 
response reactions to events in such a way as to reveal meaning-making and 
various types of attributions. Nonetheless, whether individuals develop ac- 
counts (private or public accounts) without being prompted is a much debated 
topic that needs to be further examined. 

One of the most challenging issues for future work on accounts as objects of 
inquiry, yet central to all sociological inquiry, is a detailed focus on the context 
in which individuals make and present accounts. Are there certain periods of 
life or stages of the life course (e.g. major transitions such as mid-life crisis, 
change in profession, retirement) when account-making is intensified or when 
a fuller account is presented to others (perhaps because individuals received 
feedback and refined the account over time). A related topic that needs further 
discussion is the importance of status (e.g. gender, ethnicity) or personality 
(e.g. extroversion, empathy) characteristics in understanding the narratives 
that individuals present. 

Relevant to this later concern is work on social memory which suggests that 
women may be more likely than their male significant others to take on the role 
of "relationship historian" and to maintain a continuing effort to observe, docu- 
ment, and analyze major relationship events (Holtzworth-Munroe & Jacobson 
1985, Ross 1988). Further, Veroff, Orbuch, and their colleagues (Veroff et al 
1993a,b,c, Orbuch et al 1993) have examined the relationship stories of black 
and white newlyweds in a longitudinal study of marriage. Results from the 
study suggest that by and large the couple narratives are quite similar in both 
groups, but important differences do emerge. It is their contention that these 
differences yield especially meaningful information about how race/ethnicity 
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influences the means and product of storytelling. I select two examples from 
this study for illustration. First, among black couples, there was greater mention 
of affects for which the husband was the source than in white couples, whether 
the affect was mentioned by the wife or the husband (e.g. “My husband didn’t 
want to have a large wedding”). Veroff et al (1993a) state that these findings 
may suggest that husbands have more influence in forming the meaning of mar- 
riage for black couples than they do for white couples. Second, in a study by 
Orbuch et al (1993), white husbands and wives were significantly less dramatic 
in storytelling than black husbands and wives. These results support Heath’s 
work (19835, which indicates that blacks may have more experience with the 
storytelling discourse. 

There are also only a few clues available in the literature on account-making 
about the various themes per types of events that people may emphasize (Harvey 
et al 1990b, Veroff et al 1993a,b). A categorization of various themes would 
be extremely useful when coding and analyzing individuals’ accounts, and a 
lack of such is one of the major weaknesses of the current accounts literature. 
While justification and exoneration of self are common themes in the marital 
separation literature, these themes may not resonate with accounts of satisfying 
relationships or illnesses that are life-threatening. Gergen & Gergen (1987, 
1988) have also discussed the ways that account themes differ with social 
context. They contend that accounts change over time and presentation; the 
presentation of an account is analogous to other social behaviors; it is influenced 
by the current circumstances, audience, and social setting. 

Thus, another important direction still largely ignored in the literature is the 
persuasive nature of these constructions. I have discussed the self-presenta- 
tional motivation for account-making; however, the accounts literature gives 
us little insight into the dynamics of the influence process. Many argue that 
stories are not based on historical truth but instead are used to manipulate 
or persuade some preferred self-presentation to others. LaRossa (1995) sug- 
gests that "more attention should be paid to how people socially create stories 
about what really happened back then, to suit their own political interests" 
(p. 553). The reasons for these myths and manipulations, rather than the ac- 
counts themselves, have promise for investigation of the persuasion processes 
as they are influenced by account-making. The audience and cultural under- 
standings fcr our account-making activities are important in this persuasion 
process. 


The Means of Inquiry 


Sociologists also can use respondents’ accounts as a means of assessing (or 
illuminating) other aspects of the social world (Ewick & Silbey 1995). An 
emerging literature demonstrates how widely accounts are used to study other 
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social processes or phenomena that interest sociologists. Respondents' explana- 
tions in story-like form have been found to reveal distinctive information about 
interpersonal relationships (Huston et al 1981, Orbuch et al 1993, Surra et al 
1995, Veroff et al 1993b), adjustment and coping with stress or loss (Ellis 1995, 
Harvey 1996, Harvey et al 1990b, Pennebaker 1989, 1990, Silver et al 1983, 
Vaughn 1986), the construction of gender (Barbre et al 1989, Etter-Lewis 1993, 
Stanley 1993), and the development or generativity of the self (Baumeister 
1986, 1989, McAdams et al 1996). 

The following excerpt is from an account that was provided by a 47-year old 
male who experienced sexual assault between the ages of 3 and 12 by his father, 
cousin, and men who worked at his family home. It provides insight into how 
accounts can be used as a means to understand or assess the severity of sexual 
assault and its aftermath. 


It has affected every portion of my external and internal life for the last 44 years Close 
relationships? I don't have any, and never have had . And I've been married for 25 years 
now! But it is not a close, personal or intimate relationship. If anything, it is abusive to 
me. I don't trust anyone. " (Orbuch et al 1994, p. 260) 


Further, we do not, as yet, have evidence about how the more subtle and 
complete types of accounts that probably predominate in most people's lives 
are related to behavioral expectations or actual lines of behavior (Harvey et al 
1990b). For example, a type of explanation for a past break-up that emphasized 
a too rapid movement to intimacy might be related to future expectations of 
“going slower" in developing the next relationship or “getting to know someone" 
before getting involved in another relationship. 

As significant is the challenge of trying to establish causality in the link 
between accounts and social psychological phenomena (e.g. social behavior, 
identity formation, adjustment to stressor). Most of the research conducted 
to date has been nonexperimental in design and cross-sectional in nature. It 
is crucial, however, to explore whether the relationship between accounts and 
social psychological phenomena holds up longitudinally and across different 
domains of social life. 

As part of a longitudinal study of the early years of marriage (Veroff et al 
1995), joint accounts by couples were “used to understand the meaning that 
newlyweds in modern urban society make of their relationship and how this 
emerging meaning may relate to present and future well-being” (Orbuch et al 
1993, p. 815). These joint accounts provided researchers with new information 
about marital processes, beyond anything that can be assessed with standard 
survey measures (Veroff et al 1993c). A good illustration of this contribution 
is provided in an analysis of collaboration in these joint accounts. For all but 
the black husbands, the fact that the couple had some conflict in telling their 
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courtship story in Year 1 (of their marriage) was indicative of less marital hap- 
piness in Year 3 than if the couple did not have this kind of conflict. Apparently, 
not being able to agree on the story of the relationship has long-term implica- 
tions for the well-being of the marriage. These findings support a causal link 
between collaboration in couples' joint accounts and future marital well-being. 

A number of sociologists have utilized storytelling methods or accounts- 
like concepts in their research (Connell 1995, Hochschild 1989, Leidner 1993, 
Martin 1997, Stacey 1990), but they do not refer to or think of these concepts 
as accounts. I argue that in these programs of work, story-like constructions 
are used to illuminate the processes by which individuals impose order to their 
lives. For instance, in the book The Second Shift, Hochschild (1989) discusses 
what could be called the ‘Holts’ account" of their division of home labor based 
on the upstairs/downstairs division. This "sharing" arrangement functions to 
make their marriage work and allows each of them to hold onto their gender 
ideologies. Hochschild conceptualizes this arrangement as a gender strategy or 
a family myth, but it could also be conceptualized and analyzed as an account, 
if greater attention were paid to the temporal sequencing of events, attributional 
statements, and the affective qualities of the story. 

Further, if we assume that people learn accounts primarily in the same ways 
as they learn other social behaviors—from parents, peers, and other institu- 
tions such as the media and religion—then the examination of accounts and the 
process of account-making can contribute to theory and the understanding of so- 
cialization, norms, conformity, and other interactive learning processes. Future 
accounts work might also explore which institutions or agents of socialization 
are most important for the development of accounts and their presentation to 
others and why some stories are kept private, while others are confided in public. 


The Product of Inquiry 


Ewick & Silbey (1995) believe that narratives enter scholarship when social 
researchers themselves function as storytellers in producing accounts of social 
life. Accounts can enter sociology by similar means. This form of represen- 
tation can include a researcher's description of the data, the interpretation of 
the results, or the telling of his or her own account of a personal experience 
(Bochner et al 1997b, Ellis 1995, Stanley 1993). In addition, when qualitative 
researchers such as Stacey (1990) and Hochschild (1989) describe their own 
role in the research process and in the gathering of data, then the researcher 
also becomes a part of the storytelling process. 


The Use of Accounts in Sociology 


The discipline of sociology has a history of theory and research focused on 
accounts as justifications of events, beginning with the early work of Scott 
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& Lyman, Goffman, and Garfinkel. Despite this prominent history and the 
“current enthusiasm for viewing storytelling as a way to make sense out of 
personal or relational experience” (Veroff et al 1993b, p. 440), few sociologists 
have explicitly used accounts as sources of data in their present empirical work 
and analysis. 

For several alternative but complementary reasons, a greater number of so- 
ciologists have not been much involved in recent research and theory on ac- 
counts. Some argue that sociology is embedded in theoretical paradigms and 
methodologies characterized by a positivist and empiricist orientation, which 
thus prevent the acceptance and use of narratives or accounts (Maines 1993). 
Others (Abbott 1992) maintain that current sociological analyses ignore a more 
general sense of process or story because sociology emphasizes causality and 
the principles of logical positivism. According to Abbott (1992), “Our normal 
methods parse social reality into fixed entities with variable qualities. They 
attribute causality to the variables—hypostatized social characteristics—Trather 
than to agents; variables do things, not social actors. Stories disappear” (p. 428). 

Bruner (1986) distinguishes between narrative and logico-scientific modes 
of cognitive reasoning. These two means of constructing reality are quite 
distinct from one another in terms of ordering social experience, principles of 
thought, procedure, and verification. It is argued that sociology and other social 
sciences generally have promoted and supported the logico-scientific mode and 
have suppressed the narrative mode in mainstream publication outlets. (For 
a broader discussion of the two modes of reasoning and this viewpoint, see 
Bochner et al 1997b, Maines 1993, Richardson 1990). 

In addition, the accounts concept has been tied to the foundations of sym- 
bolic interactionism, a perspective that has developed and changed greatly over 
time (Fine 1993). This theoretical framework lost popularity in sociology in the 
1970s and was severely criticized for being nonempirical and lacking a structural 
emphasis; the work on accounts provoked a similar response. It may also be the 
case that since the initial developers of accounts (Burke, Goffman, Garfinkel, 
Mills, and Scott & Lyman) hardly mentioned how the concept should be oper- 
ationalized and empirically tested, the narrower but closely related concept of 
attribution (see Crittenden 1983), rather than accounts, received far more empir- 
ical attention in the 1970s and into the 1980s. More recently, however, accounts 
and other related concepts have been tested usefully via different methods, both 
in sociology and in related fields (e.g. Baumeister & Newman 1994, McAdams 
et al 1996, Orbuch et al 1993, Veroff et al 1993c). Finally, and quite significant 
to whether greater numbers of sociologists embrace the concept of accounts, is 
the currently wider recognition and acceptance by sociologists and other social 
scientists of postpositivist, poststructuralist perspectives, which have reopened 
the door to more interpretive, qualitative, and meaning-centered research. 
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CONCLLSION 


The foundation is set for a resurgence of work on accounts in sociology. I have 
reviewed early sociological writing that played an important role in defining 
and conceptualizing the term account. In the early analyses, the phenomenon 
of accounts generally was conceptualized as verbal explanations, presented in 
interaction, for potentially blameworthy or unanticipated behavior. Sociologists 
embraced this concept; it played a central role in the development of theory 
and research on individuals' search for and presentation of justifications and 
explanations of their social behavior. 

More recent work envelops a broader concept of accounts-as-stories to en- 
compass both private and public explanations for a wider array of social actions 
(which include but are not limited to potentially negative acts). This work also 
highlights the social process of account-making, the interactive nature of confid- 
ing the account to others, and the functions of accounts and account-making for 
individuals’ lives. A variety of theoretical frameworks and related concepts in 
communicated language, deviance, attributions, symbolic interactionism, and 
narratives have also motivated and influenced theory and research on accounts. 

Many social scientists believe that narration or account-making is at the 
heart of all social behavior (Bruner 1990, Coles 1989, Polkinghorne 1988). 
Yet despite the wide enthusiasm for the use of stories in scientific inquiry, few 
sociologists have addressed or employed this broad concept in their empirical 
research. It has flourished in other disciplines including psychology, anthro- 
pology, and communication studies. On the other hand, I propose that there 
are perhaps "hidden" accounts in the work of many sociologists. The implicit 
nature of accounts is apparent in much of sociology. Many of the qualitative 
approaches to interviewing, especially in-depth interviewing techniques (e.g. 
Mishler 1985, Weiss 1994), as well as ethnographies, autobiographies, artful 
science, and ethnomethodology, could be considered methods of collecting ac- 
counts (Alder & Alder 1987, Barbre et al 1939, Bochner et al 1997, Denzin 
19906, Ellis & Bochner 1997). 

The accounts concept, which has a solid foundation in early sociological work 
is quite valuable and relevant to sociologists’ interests and goals. The concept 
is useful for gaining insight into the human experience and arriving at meanings 
or collective understandings of other cultural, gender, or ethnic groups. This 
concept deserves greater explicit attention in mainstream sociology. Broadly 
construed, the ideas of accounts and account-making may be viewed as central 
to the enterprise and endeavors of contemporary sociology. “From time to time 
sociologists might well pause from their ongoing pursuits to inquire whether 
their research interests contribute in any way to the fundamental questions of 
sociology" (Scott & Lyman 1968, p. 46). 
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ABSTRACT 


Sociology has recently witnessed a rapprochement between research on organiza- 
tions and research on law. This essay reviews a number of central developments 
and tendencies in this emerging literature, with a particular emphasis on the char- 
acteristics of law as an element of the organizational environment. We begin by 
distinguishing two metatheoretical perspectives on law and organizations: the ma- 
terialist perspective, which portrays organizations as rational wealth-maximizers 
and sees the law as a system of substantive incentives and penalties; and the 
cultural perspective, which portrays organizations as cultural rule-followers and 
sees the law as a system of moral principles, scripted roles, and sacred symbols. 
Within each of these traditions, we examine three distinct facets of organizations’ 
legal environments: the facilitative environment, in which law passively provides 
an arena for organizational action; the regulatory environment, in which law ac- 
tively seeks to control organizational behavior; and the constitutive environment, 
in which law defines the basic building blocks of organizational forms and in- 
terorganizational relations. Finally, we distinguish between literature that sees 
law as an independent variable, determining organizational behavior; literature 
that sees law as a dependent variable, determined by organizational behavior; and 
a growing literature that discusses the endogeneity between law and organiza- 
tions. Although any taxonomy tends to depict clear lines where they are in fact 
murky, we think that this approach not only provides a means of categorizing the 
literature on law and organizations, it also calls attention to the many different 


ways in which law and organizations are dynamically intertwined. 
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Introduction 


Classical social theorists saw the sociology of law and the sociology of or- 
ganizations a3 intimately intertwined. To Marx, each historical epoch gener- 
ated distinctive legal forms that simultaneously reflected and reproduced that 
epoch’s fundamental organization of production (Marx & Engels 1977, Cain 
& Hunt 1979). To Durkheim (1964), the division of labor in society estab- 
lished the functional imperatives underlying prevailing legal institutions. To 
Weber (1947), legal-rational organizational structures and rationally organized 
legal structures embodied kindred tensions between efficacy, predictability, and 
domination. Yet, despite these early linkages, subsequent scholarship in the 
sociology of organizations and in the sociology of law developed for the most 
part independently. A few key theorists (e.g. Selznick 1969, Hurst 1964, Stone 
1975) attempted to bridge the gap between these subfields, but most research 
in each tradition remained largely oblivious of the other. 

In recent years, however, the conceptual separation between organizational 
and legal sociology has become increasingly difficult to maintain. Modern or- 
ganizations are immersed in a sea of law. They are born through the legal act of 
incorporation, and they die through the legal act of bankruptcy. In between, they 
raise capital under securities law, hire employees under labor and antidiscrim- 
ination law, exchange goods and services under contract law, develop public 
identities under trademark law, innovate under patent and copyright law, and 
engage in production under environmental, and health and safety law. Specific 
bodies of legal rules define, protect, and regulate many industries and most 
professions. Even the basic distinctions between "private," “public,” and “non- 
profit” are in part legal constructs. Moreover, the legal environment seems to 
have become more pervasive (or at least more visible) with the passage of time, 
as organizations face increased levels of litigation, regulation, and certification 
in many areas of activity.! 

Modern law, for its part, is awash in a flood of organizations. Law often 
emerges from the importunings of social movement organizations, charitable 
foundations, and corporations, and it is erased at the behest of such organized 
interests as well. In between, it develops meaning through its interpretation 
by organized professions, and it develops substance through its application 
by organizational compliance officers. It finds force in organizations’ litiga- 
tion decisions, and it realizes limits in organizations’ buffering and decoupling 


l'This observation should not be read as a critique Research suggests that organizations therm- 
selves have generated—and benefited from—a great deal of the recent rise in litigation, regulation, 
and certification. Managerial grumbling notwithstanding, the law is an invited guest, not an intruder, 
in most precincts of organizational life. 
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strategies. Moreover, just as organizational life seems to have become more 
legalized, legal life also seems to have become more bureaucratized, with or- 
ganizational actors assuming increasingly pivotal roles in turning the wheels of 
justice. 

In light of these trends, it is hardly surprising that sociology has begun to 
experience a quiet reconvergence of research on law and research on organi- 
zations. This rapprochement is still in its early stages, and much important 
conceptual work remains to be done. Nonetheless, the broad outlines of the 
endeavor are beginning to emerge. In the hope of accelerating these intel- 
lectual developments, this essay reviews and synthesizes some of the most 
significant strands in the still-disparate literature on the legal environments of 
organizations. 

Before beginning this exploration, it is important to note a number of top- 
ics that we do not cover here, despite their relevance to the general subject of 
law and organizations. First and perhaps most notably, although many of our 
observations may apply to criminal as well as civil matters, we review nei- 
ther the large criminological literature on white collar crime (e.g. Coleman 
1989, Wheeler et al 1982; see Braithwaite 1985 for a review) nor the smaller 
(but growing) literature on “organizational crime” (e.g. Cullen et al 1987). 
Further, we focus primarily on the relationship between the law and nonlegal 
organizations, and we do not address the large literature on the organizational 
characteristics of legal institutions such as legislatures, regulatory agencies, 
courts, law firms, police departments, and prisons. We also focus primarily on 
those legal policies that operate through judicial and quasi-judicial enforcement 
mechanisms, and we therefore intentionally exclude spending, taxation, and the 
direct provision of goods and services by nonjudicial state agencies. Finally, 
due to space limitations, we largely confine our attention to the sociological 
literature, and we therefore do not systematically cover the sizable parallel lit- 
eratures in political science and “law and economics.” The following pages, 
then, examine sociological accounts of how organizations that are formally out- 
side the legal realm relate to the civil laws and regulations that they encounter 
in their social environments. While this scope does not encompass every pos- 
sible body of relevant thought, it nonetheless covers quite a wide range of 
material. 


Dimensions in the Analysis of Law and Organizations 

Our review divides the literature into two metatheoretical perspectives on law 
and organizations. The first might be labeled a rational materialist perspec- 
tive. Prominent in both classical studies of organizations and the newer law 
and economics tradition, this perspective sees organizations as rational wealth 
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maximizers and sees the law as a system of substantive incentives and penal- 
ties (for a recent survey of law and economics, see Cooter & Ulen 1996) 
Thus, organizations instrumentally invoke or evade the law, in a strategic ef- 
fort to “engineer” legal activities that bring the largest possible payoff at the 
least possible cost. In opposition to this materialist perspective, a somewhat 
smaller camp (primarily composed of neoinstitutional sociologists) offers what 
could be called a normative cultural alternative (for reviews of neoinstitutional 
theory, see Powell & DiMaggio 1991, Scott 1987, Scott & Meyer 1994, Scott 
1995, Suchman & Edelman 1996). Here, material costs and benefits are, at 
best, a secondary concern. Instead, this perspective sees organizations as cul- 
tural rule-followers and sees the law as a system of moral principles, scripted 
roles, and sacred symbols.’ Thus, organizations look to the law for normative 
and cognitive guidance, as they seek their place in a socially constructed cul- 
tural reality. Law provides a model of and for organizational life, defining roles 
for organizational actors and meanings for organizational events—and imbuing 
those roles and meanings with positive or negative moral valence (cf Geertz 
1983). If materialist accounts speak in the language of game theory, deter- 
rence theory, and transaction cost analysis, culturalist accounts invoke labeling 
theory, symbolic management theory, and institutional analysis. 

The material-cultural dichotomy is not the only noteworthy split in the litera- 
ture on law and organizations. À second set of divisions arise because the legal 
environment is multifaceted, and different research traditions emphasize differ- 
ent aspects. Specifically, organizations encounter law in at least three distinct 
guises, which we here call the facilitative, regulatory, and constitutive legal 
environments, respectively (cf Suchman 1993a, Scott 1993).4 

As a facilitative environment, law provides an exogenously generated, but 
fundamentally passive set of tools and forums, which managers actively employ 
to accomplish various organizational goals. This is the world of the lawsuit, 
the public hearing, and the Freedom of Information request, as well as of the 


2Some models relax this clam by positing that organizations are merely bundles of quasi- 
contractual relations between rational wealth-maxumuzing individuals (Alchian & Demsetz 1972, 
Jensen & Meckling 1976). Others add the possibility that when information 1s costly, rational 
maximizing may give way to boundedly rational satisficing (March & Simon 1958, Arrow 1974, 
Williamson 1975). Nonetheless, the general emphasis on materially driven, instrumental action 
remains at the core of all such accounts. 

*Many culturalist models emphasize that legitimation processes can give organizations a ma- 
terial interest in at least superficial conformity (Meyer & Rowan 1977, Dowling & Pfeffer 1975, 
for a review, see Suchman 1995b) While such accounts clearly allow some room for rational 
decision-making dynamics, they nevertheless continue to accord a central place to culturally driven 
symbolic action. 

‘Broadly, the facilitative, regulatory, and constitutrve environments in our analysis correspond 
to procedural law, substantive law, and definitional law, respectively, in Suchman (1993a) and Scott 
(1993) 
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numerous informal maneuvers that arise “in the shadow of the law" (Mnookin & 
Kornhauser 1979). Here, law appears as a system of procedural rules, furnishing 
legal vehicles for organizational initiatives that might otherwise occur through 
market tactics, media campaigns, industrial espionage, violent self-help, etc. 
In facilitative law, organizations are the players, and the legal system is merely 
an arena—albeit an arena whose shape may dramatically affect the course of 
the game.? 

The regulatory environment, in contrast, places law in a far more active 
posture. This is the world of antidiscrimination laws, health and safety laws, 
antitrust laws, environmental protection laws, and the like. Here, law appears as 
a system of substantive edicts, invoking societal authority over various aspects 
of organizational life. The rhetoric, if not the reality, of regulation is one of 
top-down sovereign control: The legal system (presumably on society's behalf) 
is taking the initiative directly to modify organizational behavior. 

The third facet of law—the role of law as a constitutive environment—is at 
once more subtle and more profound than the other two. Here, the legal system 
constructs and empowers various classes of organizational actors and delineates 
the relationships between them. Thus, if the facilitative legal environment is 
comprised primarily of procedural rules, and the regulatory legal environment 
is comprised primarily of substantive edicts, the constitutive legal environment 
is comprised primarily of definitional categories—those basic typologies that 
identify the legally cognizable components of the social world and that explain 
the natures and attributes of each. Constitutive law generally functions almost 
invisibly, providing taken-for-granted labels, categories, and "default rules" for 
organizational behavior; however, by establishing the background understand- 
ings that frame social discourse, constitutive law helps to determine what types 
of organizations come into existence and what types of organizational activity 
gain formal recognition. Thus, for example, the constitutive legal environment 
describes how various classes of organizations are born and how they die; it 


5We use the term “facilitative” not only because this facet of law "facilitates" certain orga- 
nizational strategies, but also (and primarily) because the word hints at law’s ability to provide 
"facihties" for organizational action Patent infringement suits, for example, may not facilitate 
organizational activity on the part of the defendant corporations, but they do offer a forum in 
which the parties can build competitive advantages over—or cooperative relations among—-one 


another. 
5It ıs perhaps worth noting that the regulatory legal environment, as we define it here, is not 


precisely coterminous with either lay or legal conceptions of “regulation” Seen as the invocation of 
societal authority over organizational life, the regulatory legal environment includes common law 
as well as statutes, but ıt largely excludes laws (or parts of laws) that establish operating procedures 
for administrative agencies or that define categories of regulated activity (In our typology, these 
legal elements would be classified as facilitative and constitutive, respectively.) Further, although 
we do not specifically address general criminal law in this essay, most such material would fall 
within our definition of the regulatory environment, broadly understood. 


484 EDELMAN & SUCHMAN 


defines whether particular interorganizational relationships are to be understood 
as transitory or open-ended; and it establishes which organizations can act as 
corporate “persons” and which can act only as collections of separable interests. 
In short, although constitutive law often seems more placid and routine than 
facilitative and regulatory law, this aspect of the legal environment provides the 
fundamental definitional building blocks that undergird the other two. 

As well as being separated by metatheoretical orientation (material/cultural) 
and by focal environment (facilitative/regulatory/constitutive), writings on law 
and organizations also vary in the nature of the causal questions that they ask. 
Most investigations, especially within the sociology of organizations, treat law 
as an independent variable, and examine how exogenously established legal 
conditions affect the practices and outcomes of various organizational entities. 
Other analyses, particularly within the sociology of law, treat law as a depen- 
dent variable, and ask how exogenously determined organizational activities 
contribute to the formulation and implementation of various legal initiatives. 
Most recently. a third causal imagery has begun to emerge, primarily among 
scholars working at the border between law and organizations. Rather than 
treating either legal or organizational events as purely exogenous, this approach 
highlights the endogeneity of both organizations and their legal environments, 
arguing that organizations construct and configure legal regimes even as they 
respond to them. 

The divisions in the literature are not entirely independent. Materialist ac- 
counts, for example, tend to focus on the facilitative and regulatory legal en- 
vironment, whereas culturalist accounts tend to focus on the regulatory and 
constitutive environment. In addition, materialists are somewhat more likely 
to treat law as purely dependent or purely independent, whereas culturalists 
are beginning to attend more closely to endogeneity. The following pages re- 
view first materialist and then culturalist writings, in both cases exploring how 
metatheory directs attention toward particular facets of the legal environment 
and toward particular types of causal argument. As becomes clear, however, 
the literature on law and organizations encompasses a wide range of issues and 
perspectives, and it often defies easy categorization. 


Materialist Approaches 


Materialist approaches to the relationship between law and organizations trace 
their intellectual ancestry back to Beccaria (1988), Marx (Marx & Engels 1964), 
Holmes (1897. 1881), and Weber (1978). In keeping with this heritage, recent 
contributors come not only from sociology but also from economics, politi- 
cal science, and law, and many of their writings conjoin abstract theory and 
empirical description with substantial doses of prescriptive policy analysis. In 
broad outline, materialists see the law as a system of concrete penalties and 
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rewards, designed to deter some lines of action and to encourage others. Or- 
ganizations, for their part, respond as self-interested rational actors, making 
calculated decisions about when and how to influence, invoke, obey, and evade 
the legal system in order to avoid law’s costs and exploit law’s benefits. Because 
of its emphasis on material sanctions, this approach tends to assume that legal 
rules will be most efficacious (and therefore most worthy of study) when they 
are substantive and unambiguous; because of its emphasis on organization-level 
rational action, this approach tends to assume that organizational identities, in- 
terests, and capacities exist independent of the law. Consequently, materialists 
generally downplay endogenous and constitutive processes. Instead, they focus 
primarily on how facilitative and regulatory legal apparatuses emerge from and 
subsequently determine organizational activity. 


Materialist Perspectives on the Facilitative Environment 


The legal system provides tools and forums for a wide range of organiza- 
tional activities, from dispute resolution (Westin & Feliu 1988, Ewing 1989) 
to information gathering (Casey et al 1983, Farrell 1984) to coalition building 
(McCann 1991, 1994); however, perhaps the most frequently studied aspect of 
the facilitative legal environment is litigation.’ Business litigation rates have 
risen dramatically since the early 1970s, in part due to increased litigation by 
individuals against firms, but largely due to an increased willingness of corpo- 
rations to sue one another (Cheit 1990, 1991, Dungworth et al 1990, Galanter & 
Rogers 1991, cf Wanner 1974). This change has brought with it both an expan- 
sion of in-house counsel offices (Chayes & Chayes 1985, Galanter & Rogers 
1991) and a burgeoning of independent corporate law firms (Galanter & Palay 
1991, Nelson 1988). Thus, litigation is becoming an increasingly salient aspect 
of day-to-day organizational operations. 

Significantly, although research on individual-level litigation often high- 
lights symbolic and expressive factors (see, e.g., Felstiner et al 1980-1981, 
Danet 1980, Lind & Tyler 1988, Mather & Yngvesson 1980—1981), organiza- 
tional research almost invariably adopts a more materialist stance, emphasizing 
the strategic choices that organizations make in using lawsuits as tools of di- 
rect (nonmarket) competition (e.g. Priest & Klein 1984, Cheit 1991:129—30). 
Much of this literature emerges out of policy debates surrounding litigation 
reform. Such studies generally treat procedural rules as exogenous variables 
affecting the costs and benefits (and hence the form and frequency) of organiza- 
tional disputing. Standing-to-sue doctrines (Snyder 1985, Alpert 1988, Rabkin 
1989), class-action requirements (Scott 1975, Yeazell 1987), contingency fee 


TThis having been said, it is also worth noting that litigation involving businesses appears 
radically understudied when compared to litigation involving individuals (Cheit 1991, Hayden 
1989) 
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arrangements (Olson 1991, Donohue 1991, Gravelle & Waterson 1993), and 
jurisdictional rules (Bumiller 1980-1981) have ell attracted attention as deter- 
minants of litigation behavior—both litigation by organizations and litigation 
against them (see, generally, Cotterrell 1992:250—57, Cappelletti 1989). Fur- 
ther, several observers have posited that aggregate changes in litigation may 
produce a wide array of systemic side-effects, including increased insurance 
use (Cheit 1991:127), elevated bankruptcy rates (Delaney 1989), and a soci- 
etal undersupply of high-risk innovation (Cheit 1991:130; see generally, Huber 
1988). In short, as independent variables, procedural rules determine how at- 
tractive court-oriented disputing will be, and this in turn determines whether or 
not the facilitative legal environment will absorb organizational energies that, 
for better or worse, might otherwise be directed elsewhere. 

Alongside such analyses of facilitative law as a determinant of organizational 
behavior, other materialist researchers have reversed the causal arrow, to explore 
the ways in which organizational behavior may shape this legal environment in 
return. Formally, at least, courts are passive institutions, and although procedu- 
ral rules may encourage or discourage the filing of court papers, most litigated 
disputes originate outside the judicial arena. As Galanter & Rogers (1991) 
observe, lawsuits are only one of several ways that organizations can govern 
their interactions. Others include: markets (Williamson 1975, 1981), state reg- 
ulation (see below), community norms (Macaulay 1963, Ellickson 1986), labor 
organizations {Vittoz 1987, Rogers 1990), and trade associations (Streeck & 
Schmitter 1985, Aldrich et al 1994). If litigation arises, in part, from the failure 
of these alternative mechanisms, then extra-legal organizational conditions— 
such as competition, product specialization, relational duration, deal complex- 
ity, economic globalization, and economic instability—may play a crucial role 
in shaping the work of the courts (Galanter & Rogers 1991:41 ff). 

This influence may extend beyond the rate of litigation to include the na- 
ture of litigation as well. Strategic decisions about when and how to mobilize 
the facilitative legal environment can, in the aggregate, produce lasting shifts 
in underlying legal rules (Priest & Klein 1984). Moreover, as well-endowed 
“repeat players," organizations are often uniquely capable of using litigation 
to conform the law to their needs (Galanter 1974, Wanner 1975, Wheeler et al 
1987, but cf McIntosh 1985). This is perhaps clearest with regard to substan- 
tive principles of liability and remedy; however, it applies with equal force to 
procedural principles of access, initiative, and proof as well. Thus, over time, 
accumulated pressures from new forms (or new levels) of litigation may alter 
the very practices that define the disputing arena in the first place (Galanter 
1990, Yeazell 1987).* In the long run, both the docket and the structure of the 


®The impact of strategic action on disputing procedures may not always be so gradual and inad- 
vertent. More critical analyses tend to argue that procedural law, like substantive law, often responds 
quite directly to the interests of powerful elites in civil society (e g Chambliss & Seidman 1982). 
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court system are likely to reflect material conditions and economic concerns in 
the larger organizational world. 


Materialist Perspectives on the Regulatory Environment 


Perhaps the largest branch of the materialist literature addresses not the use of 
facilitative law by organizations, but rather the imposition of regulatory law 
on organizations. From this perspective, the legal environment consists of a 
system of incentives and penalties, designed largely to compensate for market 
imperfections such as monopolies and externalities (Friendly 1962, Burk 1985). 
Law, in this view, is justified by the need for collective societal intervention 
into certain areas of organizational life, but it is constantly in danger of being 
captured by its targets and turned against the common good that it supposedly 
defends. 

Within a materialist perspective, analyses of regulatory law as an indepen- 
dent variable generally seek to determine how specific legal sanctions im- 
pinge on the strategic choices of rational organizational decision-makers (e.g. 
Peltzman 1976, Diver 1980, Scholz 1984, Paternoster & Simpson 1993, 1996, 
Braithewaite & Makkai 1991, Genn 1993). The regulatory regime, in this view, 
is simply one element among many in the larger managerial cost-benefit picture. 
Significantly, however, regulatory costs differ somewhat from market costs, par- 
ticularly in the degree to which organizations can escape or influence them. As 
a result, materialist accounts of regulation often argue that self-interested or- 
ganizations respond not only to the incentives and penalties enunciated by law 
“on the books,” but also to the real-world gaps provided by ambiguous statutes 
and manipulable regulatory agencies. Thus, while some theoretical treatments 
assume that regulation is explicit, authoritative, and coercive (for a review, see 
Suchman & Edelman 1996), empirical analyses generally suggest that this “‘le- 
gal formalist" imagery is at best an abstract ideal (e.g. Macaulay 1963) and 
that, in reality, the regulatory environment is often ambiguous, contested, and 
riddled with loopholes. Although few materialists would rule out the possibility 
of successful deterrence (see e.g. McCaffrey 1982, Scott 1989, Paternoster & 
Simpson 1993), the overall picture is one of noncompliance, subversion, and 
evasion (e.g. Kagan & Scholz 1983). 

Truly rational organizations are difficult to regulate in part because they lack 
the moral sentiments that promote individual-level obedience to law (cf Tyler 
1990), and in part because their structures often insulate the managers who engi- 
neer lucrative transgressions. Christopher Stone's Where the Law Ends (1975) 
points out that since corporations cannot be imprisoned, and since individual 
employees are rarely held responsible for corporate wrongdoing, the corpo- 
rate pocketbook often becomes the primary target of efforts at social control. 
Legal sanctions, however, are usually too small and too slow to affect ratio- 
nal organizational planning. Stone also suggests that common organizational 
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characteristics such as task specialization, decentralized decision-making, and 
interdepartmental competition often marginalize legal requirements and encour- 
age noncompliance (see also Jowell 1975). Similarly, Katz (1977) shows how 
corporate officers can effectively mask regulatory violations from outside re- 
view (see also Ekland-Olson & Martin 1988). Perhaps for these reasons, recent 
research suggests that conventional deterrence theory does a relatively poor job 
of predicting managerial responses to regulatory law (Braithewaite & Makkai 
1991, Makkai & Braithewaite 1994, Simpson & Koper 1992, Paternoster & 
Simpson 1996; but see Block et al 1981)? 

In the materialist view, the regulatory legal environment not only confronts 
recalcitrant self-interested organizations, it also often suffers from internal com- 
plexities and contradictions. While law creates sanctions for certain behaviors, 
it also creates defenses and loopholes that rational organizations may exploit to 
escape those sanctions (Mashaw 1979). As a result, the history of regulation is 
replete with unintended consequences. Examples from the literature include: 
the use of vertical integration as a means of evading sales tax (Stigler 1968); 
the spread of poison-pill takeover defenses as a buffer against the market for 
corporate control (Davis 1991, Powell 1993); the proliferation of uninforma- 
tive disclaimers as an alternative to meaningful securities disclosure (Samuels 
1996); the creation of wholly owned subsidiaries as a “liability firewall" (Barney 
et al 1992); and utilities meeting local pollution standards not by reducing ef- 
fluents but by building tall smokestacks, thereby transporting their emissions 
to distant regions (Ackerman & Hassler 1981). The common theme behind all 
of this research is that the regulatory legal environment may exert its largest 
effects not by motivating compliance, but rather by motivating evasion.!? 

To a substantial extent, the complexity of real-world regulation stems from 
the fact that few (1f any) regulatory regimes actually emerge independently 
of the organizational actors whom they ostensibly govern. This insight forms 
the basis for a large body of work on the regulatory environment as a de- 
pendent variable. In particular, several streams of research have explored the 


9? As we discuss below, regulatory deterrence theory may also perform poorly because it ex- 
aggerates the extent to which organizations behave as fully rational actors, free from normative 
and cognitive cultural constraints. Stone (1975) hints at such considerations when he advocates 
creating internal monitoring roles to bring the voice of conscience into corporate decision-making. 

10Whule this literature raises doubts about the ability of regulation to deter organizational wrong- 
doing, a handful of studies also question whether organization-level rationality is necessarily the 
best metaphor for understanding the material effects of the regulatory environment. In some 
settings, regulaticn's primary consequence may be to empower intraorganizational interest groups 
(Pfeffer 1981, Edelman et al 1992) or to mfluence founding and failure rates 1n super-organizational 
populations (Wholey & Sanchez 1991) If so, then organization-level evasions may prove largely 
epiphenomenal. 
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autonomous impact of “regulated” organizations on both the legislative and the 
administrative process. 

With respect to legislation, research suggests that most legal initiatives emerge 
out of complex networks of power, in which organizations are central to all 
sides of the debate (Laumann & Knoke 1987, Perrucci & Potter 1990). Re- 
cent studies of social movements, for example, clearly document the extent to 
which formal organization has become a key determinant of political success 
(McCarthy & Zald 1977, Zald & McCarthy 1980, Wolfson 1995). At the same 
time, other investigations equally clearly document the extent to which tar- 
get organizations, themselves, band together into countermovements or “‘action 
sets,” often under the aegis of trade associations, industry working groups, and 
the like (Aldrich & Whetten 1981). Both individually and in collaboration, 
such organizations make substantial investments in electoral politics and in 
backstage agenda-setting (Lukes 1974, Domhoff 1983).!! In a study of energy 
and health care policy, for example, Laumann & Knoke (1988) find that large 
organizations dominate the national decision-making process, often playing a 
greater role than do elected representatives. Even the frequently noted perva- 
siveness of lawyers in national policy debates is less a source of autonomy for 
the legal system than a source of influence for those lawyers’ corporate clients 
(Nelson et al 1988). 

Although popular accounts often assume that organizations enter the political 
process primarily to minimize government "interference," empirical evidence 
argues to the contrary. In many cases, organizations actively seek regulation as a 
means of controlling their environments or obtaining advantages over their com- 
petitors. Historically, for example, the railroad industry has lobbied for weight 
and size limitations on interstate trucking (Stigler 1971); the National Associa- 
tion of Manufacturers has lobbied for restrictive labor legislation (Gable 1953); 
and the medical, dental, and legal professions have lobbied for licensure statutes 
that artificially depress the supply of practitioners (Holen 1965, Pfeffer 1974, 
Zhou 1993). Inlight of this evidence, Posner (1974) goes so far as to suggest that 
regulation, like other commodities, may be supplied at a price and may face a 
demand distribution. Certainly, a great deal of lawmaking seems to respond pri- 
marily to the parochial interests of targeted firms, industries, and sectors (Downs 
1957, Kolko 1963, Fiorina & Noll 1976, Sabatier 1975, McCormick 1979). 

Moreover, the ability of organizations to shape the legal environment does not 
end with the passage of legislation but rather continues on into the implemen- 
tation phase as well. In a significant departure from the "explicit, authoritative 


11 Organizations also make substantial investments in litigation strategies designed to yield fa- 
vorable rulings on points of statutory ambiguity (Galanter 1974) 
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and coercive” ideal, legislative compromises often yield statutes that either 
provide little regulatory guidance, or demand impossible regulatory results, or 
both (Clune 1983, Hawkins 1984, Baier et al 1988, Blumrosen 1993).!? Conse- 
quently, administrative agencies enjoy a great deal of discretion in rule-making 
and enforcement, and regulators can become politicized both on the basis of 
personal preferences and on the basis of bureaucratic agendas (Luchansky & 
Gerber 1993, Newman & Attewell 1985, Jowell 1975). To avoid alienating 
powerful constituencies, regulatory agencies generally avoid full use of their 
statutory powers (“hard enforcement") and instead favor informal enforcement 
methods such as education, persuasion, negotiation, and publicity (Hawkins 
1984, Bardach & Kagan 1982, Cranston 1979, cf Vogel 1986). 

In addition, owing to poor funding, public agencies often rely on indus- 
try for information, personnel, and technical expertise, further heightening 
the influence of ostensibly subordinate organizations on the regulatory pro- 
cess (Leiserson 1942, Bardach 1989, Breyer 1982, Makkai & Braithewaite 
1992). Materialist models often refer to such bottom-up dynamics as "regu- 
latory capture" (Bernstein 1955), highlighting the ability of regulated indus- 
tries to ensure supportive policies through power-dependence relations with 
key agencies and key agency personnel (cf Pfeffer & Salancik 1978, but see 
Levine 1981, McCaffrey 1982, Freitag 1983). Thus, far from imposing ex- 
ternal social control on organizational deviance, the regulatory legal environ- 
ment often merely institutionalizes the indigenous practices of the regulated 
population itself (Blumrosen 1965, 1993, Wirt 1970, Ackerman et al 1974, 
Katz 1977, Conklin 1977, Cranston 1979, Diver 1980, Clune 1983, Vaughan 
1983, Hawkins 1984; but see Levine 1981, Horwitz 1986, Luchansky & Gerber 
1993). 

The materialist perspective on regulatory law, then, suggests that state- 
imposed penalties and incentives may do less to deter wrongdoing than to 
motivate circumvention and manipulation. In this sense, the regulatory envi- 
ronment merges somewhat with the facilitative environment, as organizations 
employ regulatory tools to reconfigure their markets and to divert attacks on 
their autonomy. 


Materialist Perspectives on the Constitutive Environment 


As suggested above, because of its emphasis on concrete sanctions and rational 
choice, the materialist perspective tends to downplay law’s more constitutive 
aspects. Nonetheless, at least one strand of this literature does address the 


Indeed, one plausible way to understand ambiguous statutes is as devices for overcoming 
legislative contention by implicitly allowing each side to “make a bet" on the outcome of subsequent 
interpretation This technique has the political appeal of permitting the ultimate loser to plead good 
1ntentions and to decry the sorry perversion of "legislative 1ntent " 


LEGAL ENVIRONMENTS OF ORGANIZATIONS 49] 


role of legal categories, capacities, and “default rules” in organizational life— 
albeit in a way that owes more to information economics (Stigler 1961) than 
to social phenomenology (Berger & Luckmann 1967, see below). This is the 
“transaction cost analysis” tradition, most often associated with institutional 
economists such as Coase (1937, 1960), Williamson (1975, 1981, 1985), and 
North (1981), but also reflected in more sociological work on industrial structure 
and governance (e.g. Campbell & Lindberg 1990, Campbell et al 1991). 
Building on the path-breaking work of Ronald Coase (1937, 1960), trans- 
action cost analysis argues that organizations are best understood as devices 
for efficiently governing economic relations when markets fail due to uncer- 
tainty, bounded rationality, monopoly, and opportunism (Williamson 1975, 
1981, 1985). Significantly, the incidence of market failures—and, hence, the 
need for organizations—-reflects not only technical factors and human frailties 
(cf Williamson 1975:40), but also the nature and clarity of the prevailing legal 
regime (Coase 1960, North 1981). Law-and-economics research, for exam- 
ple, has traditionally emphasized the ability of “efficient” rules to minimize 
transaction costs in the market and, by implication, to maximize the scope of 
markets versus organizational and regulatory hierarchies (e.g. Posner 1972). 
More recently, researchers have also noted that legal rules may operate on the 
other side of the ledger as well, providing rights and immunities that enhance or 
impede the efficiency of hierarchical governance mechanisms—and hence, that 
affect their desirability relative to markets (Masten 1990, Williamson 1991). 
All of this suggests that law can exert a substantial inituence on organizations 
by creating an encompassing framework of basic property rights." Campbell & 
Lindberg (1990), in particular, have argued that by failing to recognize the legal 
system's capacity to establish such rights, prior research may have grossly un- 
derestimated the role of the state (especially the American state) in the economy. 
In reality, these authors suggest, both the state's direct actions and its overall in- 
stitutional structure "define and enforce property rights [that determine] social 
relations, and, therefore, the balance of power among a wide variety of economic 
actors" (1990:636). The legal system, in this view, both creates and responds 
to pressures for governance change by modifying the rights regime. Further, 
once a range of new governance mechanisms has begun to emerge, property 
rights (both pre-existing and newly created) channel the social processes that 
select among competing alternatives. In particular, by ratifying (or refusing 
to ratify) certain arrangements, rights rulings effectively delimit the search for 
new solutions. Thus, law shapes the transaction costs to which organizations 
respond, and it determines the viability of various organizational responses. 


13 This literature tends to define “property nghts" broadly, to include enforceable legal claims that 
practicing lawyers would identify as being grounded in principles of contract, tort, antitrust, etc, as 
well as those grounded in the law of property, per se (see, e g., Campbell & Lindberg 1990.635) 
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Of course, as Campbell and Lindberg acknowledge (1990:643), changes in 
legal rights may be a consequence, as well as a cause, of organizational gov- 
ernance decisions. Within organizations theory, it is fairly well established that 
the strategies of firms and the structures of industries can exert lasting im- 
pacts on the distribution of transaction costs (e.g. Chandler 1962, 1977, 1990, 
Bernstein 1992, Suchman & Cahill 1996). This implies that such organiza- 
tional phenomena can also influence the efficiency of various legal regimes. 
Moreover, organizations may occasionally seek new “legal devices" to address 
specific governance problems (Powell 1993, cf Roy 1990). In Gilson's (1984) 
memorable phrase, corporate lawyers then act as "transaction cost engineers," 
assembling basic legal building blocks into novel social arrangements.!^ Over 
time, these hand-tailored organizational creations may diffuse and become pro- 
totypes for the more standardized “off the rack" objects of the constitutive legal 
environment (Powell 1993, Suchman 1994a, Klausner 1995).!5 

In sum, materialist perspectives tend to focus on organizations' strategic 
efforts to maneuver around and within the world of law. Motivated by interests 
in profits and efficiency, organizations seek, embrace, and exploit laws that 
promise economic rewards—and they oppose, evade, or ignore laws that impose 
economic burdens. In the materialist view, then, law is largely exogenous to 
organizations but greatly relevant to the rational choices that organizations 
make. Law may be a tool, it may be a commodity, it may be a system of costs 
and incentives, or it may be a set of coordinating ground rules; but it is virtually 
always an environment that organizations encounter concretely and contemplate 
rationally. This imagery changes considerably in cultural renditions. 


Cultural Perspectives 


Cultural perspectives on law and organizations are relatively recent arrivals on 
the intellectuzl scene, tracing their ancestry largely to the theoretical work 
of Philip Selznick (1948, 1949, 1957, 1969) and Peter Berger & Thomas 
Luckmann (1967). From Selznick's early work (1948, 1949, 1957) this literature 


This 15 not always a desirable development. The literature features several "innovations"— 
such as strategic bankruptcy (Delaney 1989) and legal tax evasion (McBarnet 1992)—that arguably 
ennch individual adopters at the expense of the larger economy. 

I5Whether or not these phenomena properly fall under the “constitutive” label represents a 
debatable point. If states intentionally adjust legal rights to accomplish policy goals, one might 
argue that such actions are regulatory in character, and if organizations strategically employ legal 
routines to resolve business problems, one might claim that the law's role 1s merely facilitative. 
Alternatively, however, one might also note (as do Campbell & Lindberg, 1990:635, 38) that rights 
regimes often change without a calculated policy impetus, aad that new governance mechanisms 
often spread without deliberate foresight. To the extent that these processes arise from the law's 
ability to lay the ground rules of an entire field of organizational activity —1including defining the 
boundaries, capacities, endowments, and relationships of the key participants—the claim that law 
18 constitutive of that field seems well justified. 
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inherits an interest in how the normative and political commitments of organi- 
zational participants (often stimulated by pressures from the external sociopo- 
litical environment) can lead certain ostensibly rational bureaucratic structures 
and practices to become "institutionalized"—or “infused with value beyond the 
technical requirements of the task at hand” (Selznick 1957:17). From Selznick's 
later work (1969, Nonet & Selznick 1978), the literature inherits a parallel in- 
terest in the ways in which societal principles of “legality” get reproduced (both 
reflected and perpetuated) through the enunciation of “citizenship rights” by the 
“private governments” of firms. Finally, from Berger & Luckmann (1967), the 
culturalist literature inherits an attentiveness to the importance of societal rule 
systems in constructing organizational forms—-not only in establishing norma- 
tive guidelines for good and bad organizational behavior, but also in establishing 
cognitive frameworks for meaningful and meaningless organizational behavior. 

As this heritage suggests, recent culturally oriented writings on the interface 
between law and organizations derive primarily from the “new institutionalism" 
in organizational analysis, a tradition that stresses the impact of cultural rules, 
models, and mythologies on organizational structures and practices (Meyer 
& Rowan 1977, DiMaggio & Powell 1983, Meyer & Scott 1983, Powell & 
DiMaggio 1991, Scott & Meyer 1994, Scott 1995). At the same time, however, 
many culturalist “law and organizations" studies also reflect the persistent in- 
fluence of the “law and society" movement, with its emphasis on the filtering of 
legal doctrines through concrete social institutions (e.g. Macaulay 1963, 1966, 
Hurst 1964, Galanter 1974, Mnookin & Kornhauser 1979, Friedman 1985; see 
Suchman & Edelman 1996 for an analysis of the relationship between organi- 
zational institutionalism and the law-and-society tradition). 

In contrast to the materialist view of law as a clearly delineated incentive 
structure that realigns organizational cost-benefit analysis, cultural perspec- 
tives see law as a pervasive belief system that permeates the most fundamental 
morals and meanings of organizational life: Law constructs and legitimates 
organizational forms, inspires and shapes organizational norms and ideals, and 
even helps to constitute the identities and capacities of organizational “actors.” 
Thus, whereas the materialist literature focuses on organizations' strategic ef- 
forts to evade and manipulate legal sanctions, culturalist approaches emphasize 
organizations' subtle but profound responsiveness to legal ideals, norms, forms, 
and categories. And whereas the materialist perspective sees law as shaping 
organizational behavior by creating incentives and penalties, the cultural per- 
spective suggests that organizations adopt certain structures and practices be- 
cause the socio-legal environment constructs those structures and practices as 
proper, responsible, legitimate, and natural. Moreover, in recent years, cul- 
tural theorists have begun to argue for a reverse causal dynamic, as well: At 
times, the legal system may bless certain organizational structures and practices, 
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in part, because extra-legal organizational communities have already endoge- 
nously constructed those structures and practices as sensible, efficient, feasible, 
or modern. Moreover, in many settings, legal and organizational understand- 
ings of rationality, propriety, and meaning coalesce simultaneously and endoge- 
nously, through an ongoing exchange of symbols and enactments, gestures and 
interpretations. 


Cultural Perspectives on the Facilitative Environment 


Because cultural accounts generally minimize the strategic motives of organi- 
zations, they are significantly less likely than materialist perspectives to focus 
on the legal tools and factlities available for instrumental use. One major ex- 
ception is the study of how organizations strategically deploy legal symbols in 
pursuit of sociopolitical legitimacy (Edelman 1990, 1992, Friedland & Alford 
1991). Because such strategies generally emerge in response to regulatory laws 
or societal normative pressures, however, we discuss this phenomenon in the 
section on the regulatory environment, below. 

Cultural perspectives on law as a facilitative environment do come into 
play, though, in literatures that address the cultural components of disput- 
ing. Whereas materialist perspectives focus on organizations' use of legal 
procedures as tools of competition, cultural perspectives emphasize the rela- 
tionship between styles of dispute resolution and "softer" organizational phe- 
nomena such as inter-organizational cooperation, community-building, public 
relations, and reputation. The focus here is often on alternative dispute resolu- 
tion (ADR)—methods of conflict handling that have developed in response to 
the problems of litigation, such as its cost in money and time, its inflexibility, 
and its tendency to destroy relationships. 

The literature on ADR by organizations primarily addresses law as an inde- 
pendent variable, in the sense that the costs of litigation—both financial and 
relational—are seen as the primary motivation for employing various ADR tech- 
niques. Thus, ADR has enjoyed growing legitimacy in the business community 
as a means of smoothing over workplace conflict, cooling out grievances, and 
avolding lawsuits (Edelman et al 1993, Morrill 1995, Lande 1995, Edelman 
et al 1996, Edelman & Cahill 1998).!6 In addition to the academic literature 
on this topic, there is also a large (and largely atheoretical) practitioner litera- 
ture, advocating ADR as preferable to litigation, primarily because it ostensibly 
avoids the costs of litigation and gives firms greater control over the disputing 
process by relaxing formal due process protections (e.g. Westin & Feliu 1988, 
Ewing 1989). 


‘6The critical view, which sees ADR as a means by which employers seek to control employees, 
18 based on a literature in the sociology of law that critiques ADR ın the context of disputes 
between individuals (e.g. Silbey & Sarat 1989, Delgado et al 1985, Cobb & Rifkin 1991, Fiss 1984, 
Hofrichter 1987) 
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The ADR literature also points to law as a dependent variable by suggesting 
ways in which the organizational embracing of ADR injects new values and 
principles into the legal system. Professional ideologies within organizations 
play a large role in determining the extent to which organizations use ADR as 
opposed to litigation (Lande 1995). And the internal legal culture of organiza- 
tions, in turn, affects the extent to which employees mobilize both formal legal 
rights and internal dispute resolution procedures in resolving grievances against 
their employers (Fuller et al 1997). Further, in the process of using ADR, orga- 
nizational grievance handlers tend to infuse legal norms with managerial values, 
such as the importance of smooth relations, the therapeutic underpinnings of 
most disputes, and the importance of building relationships and preserving 
community. Over time, these values tend to influence the resolution of analo- 
gous disputes in the formal legal system, and even to be incorporated into the 
official construction of the law (Edelman et al 1993, Edelman & Cahill 1998). 
Many new employment laws, such as the 1991 Civil Rights Act and the 1990 
Americans With Disabilities Act, in fact explicitly encourage the use of ADR 
to resolve legal claims. 


Cultural Perspectives on the Regulatory Environment 


In contrast to the materialist approach, culturalist theories of the regulatory 
environment place less emphasis on the role of legal sanctions in deterring un- 
desirable behavior, and more emphasis on the role of legal symbols in evoking 
desirable normative commitments. Regulatory law, in this view, is less a threat 
than a sermon. One of the most salient consequences of this shift appears with 
respect to the issue of compliance: Unlike materialist accounts, which often 
focus on organizations’ strategic attempts to evade external control, culturalist 
perspectives stress that even as firms maneuver around particular legal rules, 
they cannot escape the law’s larger normative influence. Further, because cul- 
turalists focus on law as a source of symbol and meaning, rather than as a 
source of coercive constraint, these researchers tend to emphasize the diffuse 
and indirect consequences of the legal environment as much as its targeted and 
direct effects. In this rendition, law serves less as a restraining deterrent for or- 
ganizational malfeasance than as a supporting framework for all organizational 
activity. Consequently, organizations often come to embody the underlying 
logic of the regulatory legal environment even as they try to evade its most 
burdensome provisions; and, conversely, organizations often come to shape the 
law through their collective sense-making activities (Weick 1993) even when 
they fail to control it through their self-interested lobbying efforts. 

In discussing regulatory law as an independent variable, culturalist research 
draws heavily on more general “neoinstitutional” accounts of how organiza- 
tions respond to societal rule systems, broadly defined (e.g. Meyer & Rowan 
1977, DiMaggio & Powell 1983, Scott 1995). Legal compliance is, in essence, 
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simply a special case of “institutional isomorphism"—the process by which 
organizations facing the same cultural rules and understandings become both 
increasingly similar to one another and increasingly congruent with the myths 
of their shared environment. The central argument is that attributes of the insti- 
tutional environment—such as proximity to the public sphere, unionization, and 
the presence of various professional groups—explain organizational responses 
to law better than do traditional technical considerations (e.g. Edelman 1992, 
Sutton et al 1994, Konrad & Linnehan 1995). 

Within this general orientation, however, one can identify important diver- 
gences in how particular researchers characterize the mechanisms of legal influ- 
ence. Different accounts emphasize, to varying degrees, three distinct sources 
of organizational conformity with law: In coercive models (e.g. Fligstein 
1991), organizations conform because law commands them to do so and im- 
poses sanctions for noncompliance. This is essentially the same dynamic as 
that emphasized by materialist perspectives—although perhaps with an added 
skepticism about whether the underlying legal rules reflect material needs or 
cultural myths. In normative models (e.g. Edelman et al 1991, Edelman et al 
1992), organizations conform because law enunciates social values, ethics, and 
role expectations, which organizations (and their members) then elaborate and, 
to various extents, internalize. And in cognitive models (e.g. Dobbin et al 1993), 
organizations conform because law makes certain forms of action seem more 
natural, plausible, and fitting than others." 

Moreover, within each of these perspectives, accounts differ in the extent to 
which they depict organizational responses as deeply transformative versus su- 
perficially ceremonial. Às Edelman (1990, 1992) suggests, organizations may 
adopt outwardly compliant structures as a visible demonstration of attentive- 
ness to legal mandates, norms, and schemas—while at the same time preserving 
traditional managerial prerogatives by decoupling structural symbols from sub- 
stantive practices (for similar observations, see Oliver 1991, Kaplan & Harrison 
1993). While some studies portray such activities as cynical window-dressing, 
ceremonialism may also arise out of either hypocritical self-righteousness or 
agnostic acquiescence to established conventions. Regardless of the underly- 
ing motivation, the decoupling of ceremony from substance arguably undercuts 
and marginalizes the role of law in organizational life. Nonetheless, several 


Readers familiar with the sociology of organizations will recognize that this depiction paral- 
lels DiMaggio & Powell's (1983) influential trichotomy of institutional isomorphisms: “Coercive 
isomorphism,” occurs when organizations submit to the demands of powerful external actors, such 
as the regulatory agencies of the state; "normative isomorphism” occurs when organizations import 
the outlooks and ethics of professionals and other organized value-carriers; and “mimetic isomor- 
phism” occurs when organizations resolve cognitive uncertainty by copying the practices of other, 
similar organizations. 
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analysts have pointed out that, if the primary impact of law is a diffuse cultural 
one, widespread ceremonialism may not be equivalent to outright illegality, 
since the former re-presents and thereby reifies the law’s cultural claims in a 
way that the latter does not (see e.g. Friedland & Alford 1991:250, Edelman 
1992:1569, Suchman 1995b:588). 

Although culturalist writings widely acknowledge the theoretical distinction 
among these various sources of conformity/isomorphism, the literature increas- 
ingly suggests that multiple isomorphic pressures may operate together in any 
particular historical instance. While early neo-institutional writings generally 
paralleled the materialist literature in viewing the regulatory legal environment 
as primarily coercive (see, e.g., DiMaggio & Powell 1983, Scott 1991, Pligstein 
1991; for a more extensive review, see Suchman & Edelman 1996), recent stud- 
ies have been somewhat more eclectic. Even authors who focus on the impact of 
ostensibly coercive legal sanctions occasionally inject cognitive or normative 
elements into their accounts—for example by highlighting the ability of co- 
erced isomorphism to foster taken-for-grantedness (Tolbert & Zucker 1983),!? 
or by exploring how the (mis)perception of a coercive threat can affect the intra- 
organizational influence of certain professional groups (Edelman et al 1992). 
Other analyses have moved still farther away from a primarily coercive view, 
emphasizing that coercive, cognitive and normative sources of conformity often 
operate co-equally (e.g. Baron, Dobbin & Jennings 1986). 

Indeed, a number of recent studies of the spread of bureaucratized personnel 
practices go so far as to suggest that coercive conformity may, if anything, play 
only a subsidiary role. Edelman (1990) depicts the extension of due process 
protections to non-union employees primarily as a normative spillover from the 
civil rights movement, which legitimated employees’ claims to fair treatment. 
Since personnel practices changed even in firms that did not formally fall under 
equal employment opportunity (EEO) law, such legal enactments presumably 
served not so much to mandate the new organizational structures as to sym- 
bolize their sociocultural importance. Similarly, Edelman (1992) and Dobbin 
et al (1993) argue that the numerous ambiguities of US employment law of- 
ten render it toothless as a source of coercion, even for firms falling within 
its official purview. At the same time, however, this statutory indeterminacy 
(coupled with substantial rhetorical posturing) creates a climate of legal uncer- 
tainty, in which the cognitive dynamics of inter-organizational mimesis and the 
normative dynamics of professional proselytizing can flourish. In this view, 
coercive effects may actually follow normative and cognitive effects, rather 
than preceding them: Initially ambiguous laws acquire sufficient specificity for 


Interestingly, Zucker (1988.33) offers a dissenting voice on this point, arguing that overt 
coercion tends to undercut extenority and objectrvity, two key components of taken-for-grantedness 
(see also Suchman 1995b) 
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judicial enforcement only after professional and organizational communities 
have socially constructed a taken-for-granted definition of compliant behavior 
(Edelman et al 1996, Suchman & Cahill 1996, Suchman & Edelman 1996). 

One important implication of such noncoercive models is that the effects of 
legal change should not appear instantaneously with the enactment of new sanc- 
tions, but rather should build gradually over time, as normative and cognitive 
beliefs become increasingly institutionalized. Consistent with this prediction, 
a substantial body of empirical evidence suggests that organizations adopt the 
structural trappings of legal compliance at a rate that is directly proportion- 
ate to the number of prior adopters: Tolbert & Zucker (1983) report essentially 
this pattern in the diffusion of civil service reforms, and several researchers find 
analogous trends in the diffusion of workplace due-process protections (Dobbin 
et al 1988, Edelman 1990, Sutton et al 1994) and in the spread of organizational 
responses to EEO law (Edelman 1992, Dobbin et al 1993).!? Similarly, Mezias 
(1990) discusses the accelerated diffusion of a novel financial reporting strat- 
egy, once the new formulation had received the professional imprimatur of the 
Accounting Principles Board; and Suchman (1995a) describes the decade-long 
spread of standardized venture capital financing contracts from Silicon Valley 
to more socio-geographically remote locales. In each of these analyses, the 
authors attribute observed diffusion patterns primarily to the gradual institu- 
tionalization of supportive professional norms and to the mimetic routinization 
of plausible cognitive models.” 

As illustrated by the significance of proactive professional proselytizing in 
many institutionalist accounts, the characteristics of the regulatory legal en- 
vironment may be a consequence, as well as a cause, of extralegal cultural 
dynamics. Although to date the culturalist literature on law and organizations 
has devoted relatively little attention to the role of organizational beliefs in 
generating new statutory enactments (but see Dobbin 1994, Suchman & Cahill 
1996), it has built on a long history of law and society research that depicts 
extralegal organizations as “filtering agents” (Kidder 1983) between “law on 


I? Interestingly, Abzug & Mezaas (1993) suggest that institutonalization has followed a different, 
and much slower, path 1n the case of comparable worth policies, where the courts have generally 
rejected employee claims, than in other EEO contexts, where the courts have been more favorably 
disposed (but see McCann 1994). While this finding does not necessanly require a purely coercive 
explanation, it does suggest that extra-legal normative and cognitive dynamics are not entirely 
autonomous from the influence of formal legal institutions. 

It is perhaps worth noting that the stickiness of enforcement mechanisms might produce some 
lag ın legal umpact, even 1f only coercive dynamics were invo:ved Grven this, an S-shaped diffusion 
curve provides, &t best, ambiguous evidence for normative and mimetic hypotheses Nonetheless, 
the lersureliness of the adoption patterns reported in the hterature—coupled with the pervasive 
impact of professionalization—strongly suggests that coercive perspectives capture only part of 
the picture. 
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the books" and "law ın action.” Culturalist research has focused, particularly, 
on three such filters: professional groups, organizational bureaucracies, and 
inter-organizational communities. 

Beyond simply serving as "conduits" for the diffusion of pre-existing legal 
norms (DiMaggio & Powell 1983, Kagan & Rosen 1985), professional groups 
also extensively filter and transform those norms, in the process. Drawing on 
the pioneering work of Selznick (1949, 1957, 1969), as well as on influential 
comments by DiMaggio & Powell (1983), many culturalist accounts of law and 
organizations call attention to the important role of professional discourses in 
constructing the meaning of initially ambiguous laws, in determining the situ- 
ations to which legal reasoning applies, and more generally, in advocating for 
the legality and legitimacy of particular worldviews. Significantly, these “law- 
making" activities are undertaken not only by attorneys, but also by nonlegal 
professionals, such as personnel officers, doctors, teachers, and accountants. 

Thus, for example, Edelman and her colleagues (Edelman 1992, Edelman 
et al 1996, Edelman et al 1991) draw on Selznick's work to posit not only that 
professionals within organizations play an active role in promoting particular 
structural responses to law (here, EEO offices and bureaucratized personnel 
procedures), but also that they bring their own political and normative commit- 
ments to bear in defining the role of the resulting structures (particularly, the 
extent to which managers are challenged to make compliance substantive as 
well as symbolic). Similarly, in an analysis of neo-natal intensive care, Heimer 
(1993) shows that Jaw has a much greater impact when it conforms with the 
authority claims of key organizational professionals (in this case, doctors). Fur- 
ther, Heimer suggests that different groups of professionals may be responsive 
to different elements of the normative environment, and that these groups may 
enjoy substantial leeway to choose among competing institutional models em- 
bodied in divergent bodies of law (cf Friedland & Alford 1991). In this sense, 
legal regulation is often “enacted” (Weick 1979) at a fairly local level, with 
intraorganizational professional constituencies playing a significant part in de- 
termining which institutional norms and scripts get reflected in organizational 
structures and practices (Scheid-Cook 1992). 

Empirical evidence on the filtering role of the professions suggests that pro- 
fessional activities can either dampen or amplify the impact of law, depending 
on the circumstances. At times, professional staffs will reframe reform-minded 
legal ideals so as to minimize the law's impact on established bureaucratic rou- 
tines. Edelman et al (1993) report, for example, that organizational complaint- 
handlers tend to construe EEO law as merely requiring the same sorts of fair 
and consistent employment practices as are implied by preexisting norms of 
good management. In recasting discrimination complaints as personality con- 
flicts or issues of "management style," grievance officers enact a therapeutic 
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understanding of civil rights law that emphasizes the restoration of harmony 
to the workplace, rather than the vindication of formal legal rights. Discrimi- 
nation is thus depoliticized and individualized, and legal influences tend to be 
undermined. 

Conversely, at other times professionals can act as a kind of Greek chorus, am- 
plifying and reifying legal threats in ways that may have little to do with reality 
(Edelman et al 1993, Galanter 1983, Givelber et al 1984, Edelman et al 1996). 
Givelber et ai (1984), for example, discuss the media’s role in (mis)educating 
therapists about the Tarasoff decision, which held that psychotherapists owe a 
duty of care to third parties who may be in danger from the therapists' patients. 
Similarly, Edelman et al (1992) show that the personnel and legal professions 
greatly inflated the risk of wrongful discharge lawsuits, encouraging employers 
to revise employment policies to address a largely nonexistant legal hazard. 
Following Pfeffer (1981) and Larson (1977), these authors suggest that profes- 
sionals sought to increase their power and prestige within organizations—as 
well as the market demand for their services—by constructing the legal envi- 
ronment as a major threat and then claiming the unique expertise to craft an 
effective response. In such cases, managers' professionally filtered perceptions 
of the legal environment may be substantially more constraining than the law 
itself. 

Professions, of course, are not alone in their ability to filter (and thereby 
transform) the law. Individual organizations, too, participate in the social con- 
struction of the regulatory environment, primarily through their bureaucratic 
practices and internal legal cultures. Indeed, organizational constructions of 
law often provide the chief conduits between formal law and everyday legal 
consciousness. For most people, the legal system is both remote and arcane, 
and popular understandings of law and legality come largely from day-to-day 
experiences in concrete bureaucratic settings, not from exposure to abstract doc- 
trine (Macaulay 1987, Sarat 1990, Ewick & Silbey 1992, Fuller et al 1997). In 
these mundane organizational encounters, formal structures symbolize commit- 
ment to legal objectives, while informal norms give content to legal principles. 
Consequently, organizational responses to law play a key role in reifying legal 
mandates in daily life. 

Once again, such filtering agents can act either to amplify or to dampen 
law's impact. Selznick's (1969) account of the spread of legalism in the work- 
place illustrates the organizational reinforcement and extension of official legal 
doctrines, and this theme reappears in more recent work on "organizational cit- 
izenship" (e.g. Scott & Meyer 1991, Monahan et al 1994) and “psychological 
contracts" (Rousseau & Parks 1992) as well. Not all organizational responses 
to the legal environment involve rights-enhancing rules, however. Thus, for 
instance, Edelman et al (1992) and Sutton & Dobbin (1996) both describe how 
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employers have inserted disclaimers into personnel manuals and application 
forms, in an apparent attempt to preserve employment-at-will in the face of 
more pro-employee doctrinal developments. Interestingly, Sutton & Dobbin 
(1996) point out that organizational characteristics associated with the creation 
of rights-enhancing structures are also associated with the creation of rights- 
negating structures; both are responses to the institutional environment. 

Finally, law is often also mediated by entire communities of organizations 
acting in concert. At this level of analysis, the influence of extra-legal orga- 
nizational actors may go beyond merely filtering the interpretation and appli- 
cation of preexisting laws to include the active construction of new regulatory 
regimes. Along these lines, the cultural perspective offers an alternative view 
of what materialists see as the organizational “capture” of state enforcement 
agencies, stressing shared belief systems rather than quid-pro-quo power de- 
pendence. Where materialists emphasize the ability of regulated industries to 
“buy off” their putative overseers, a cultural analysis would suggest that legal 
institutions often adopt industry-friendly measures because both regulators and 
regulatees operate within the same institutional fields. Thus, continuous flows 
of information, participation by the same professional groups, extensive per- 
sonnel exchanges, and exposure to a common set of rational myths lead state 
agencies and the organizations under their jurisdiction to adopt a common ideo- 
logical outlook and mutually compatible practices (Edelman 1996, Suchman & 
Edelman 1996, Suchman & Cahill 1996). Further, even when agencies manage 
to avoid cultural co-optation, constrained budgets and weakly conceptualized 
statutory objectives often lead law enforcers to prefer culturally constructed 
symbols of compliance over less certain tests of material results (Hawkins 1984, 
Blumrosen 1993)—and regulated organizations are quite willing to collaborate 
in constructing symbolic criteria that meet the needs of regulators without fun- 
damentally disrupting the established routines of the targeted sector (Edelman 
1992, Suchman & Edelman 1996). 

As the preceding pages demonstrate, culturalist discussions of the regulatory 
legal environment routinely invoke a large number of explanations in which law 
is either explicitly the independent variable or explicitly the dependent variable. 
Nonetheless, the overarching assumption that legal meanings are socially con- 
structed, and that this construction process often takes place at the level of 
entire organizational fields, implies that the relationship between law and or- 
ganizations is not so much reciprocally causal as endogenously coevolutionary 
(Suchman 1993b). 

If all legal doctrine were substantive and unequivocal and if all legal im- 
plementation were coercive and undeviating, then one could imagine a simple 
feedback loop, in which organizations would adjust their behaviors exclusively 
in response to existing laws and would exercise their influence exclusively in 
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pursuit of future laws. In the culturalist account, however, the plot-line is much 
messier than this. Because the ambiguity of law-on-the-books is not an occa- 
sional aberration but rather a political fact of life (Baer et al 1988), the practical 
meaning of any given law-in-action can only emerge through a highly interac- 
tive process of social construction. Perhaps not surprisingly, this sense-making 
exercise is likely to involve not only the official agents of the legal system 
(regulators, judges, litigators, and the like), but also the members of the focal 
organizational field (including individual firms, professional groups, trade as- 
sociations, media observers, and legal advisors). Together—through dialogue, 
litigation, and mimesis—these entities collaborate to enact simultaneously the 
meaning of the law, the nature of the enforcement threat, and the options for 
compliance. Eventually these efforts may produce a working agreement on 
what the law “is” and what it “requires,” and that agreement may gradually 
become reified and institutionalized in formal structures and rational myths; 
nonetheless, to say that the legal rules “cause” the organizational practices (or 
vice versa) is, at best, a gross simplification. The two emerge in tandem, and 
as the underlying belief system permeates both the legal and the organizational 
worlds, the boundaries between these realms become increasingly ambiguous. 

Applied to the regulatory legal environment, this endogenous imagery sug- 
gests that justifications for social control, professional commitments, organiza- 
tional structures, state enforcement mechanisms, and judicial reasoning all flow 
from interlinked contemporaneous discourses. To explicate the endogeneity of 
law, Edelman (1996) suggests that the organizations of the legal system, like 
other organizations, operate within organizational fields. The socio-legal field 
overlaps with the fields of regulated organizations, so that ideas of rationality, 
morality, and legality flow freely between the legal system and extra-legal orga- 
nizational realms. Practices in one field help to constitute ideas of legitimacy in 
the other, and the dynamic interplay between these fields gives rise to socially 
constructed legalities in both arenas. Thus, for example, Edelman et al (1996) il- 
lustrate how the rationality of grievance procedures is jointly constructed across 
organizational and legal realms, and in particular, how courts incorporate emer- 
gent organizational practices into the legal definition of reasonable compliance 
with civil rights law?! 


7lyt 18 perhaps worth noting that this image of unimpeded communication between legal and 
organizational fields contrasts sharply with European "autopoietic" theories of law, which stress 
the incomensurability of legal and economic discourses (see, e g , Luhmann 1985, Teubner 1988) 
In the autopotet.c view, law (like other discursive realms) 1s a self-referential system, ın which 
external phenomena appear only as legal representations, constructed in terms of legal orientations 
and legal imperatives Efforts to communicate across the boundaries of such systems are at best 
hazardous, and at worst umpossible While autopoiesis theory serves as a useful reminder that 
similar signifiers may have quite different meanings and consequences 1n different social spheres, 
it has yet to exert a substantial umpact on empirical mvestigations into the relationship between law 
and organizations—at least within the intellectual traditions reviewed here. 
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Cultural Perspectives on the Constitutive Environment 


Like materialists, culturalists see constitutive law as providing the basic defini- 
tional building blocks of organizational life. However, where materialists see 
these building blocks as simply being default starting points for rational engi- 
neering and negotiation, culturalists see them as being largely pre-conscious 
frameworks for making sense of the social world.” In this view, law plays 
a central role in the “social construction of reality” (Berger & Luckmann 
1967), with legal definitions often becoming reified and institutionalized as 
taken-for-granted components of “the way the world works.” The central task, 
then, becomes understanding how these consciousness-framing legal schemas 
construct—and are constructed by—organizational activity.” 

As an independent cultural variable, constitutive law establishes a structure 
of categories and definitions for understanding social relations—and provides 
a set of accepted routines for manipulating that structure. Law generates under- 
standings, for example, of what is and is not a corporation and who is or is not 
an employee; moreover, law offers taken-for-granted scripts for forming corpo- 
rations and for hiring and firing employees (Suchman & Edelman 1996). Such 
constitutive dynamics can be seen in several central aspects of organizational 
life, including inter-organizational relations, organizational structures, and the 
cultural mindset of "legal-rationality" (Weber 1947). 

Law constructs inter-organizational relations in part by providing a “cultural 
toolkit" (Swidler 1976) that organizations draw on when they interact with one 
another. This is most obviously true of contract law, which delineates appropri- 
ate symbols and rituals for forming binding commitments (Suchman 1994a; cf 
Macaulay 1963, Gabel & Feinman 1982). However, it applies to other branches 
of law, as well: Property law outlines means by which organizations claim con- 
trol over resources and ideas (Campbell & Lindberg 1990); tort law helps to 
construct the scope of organizations’ spontaneous responsibilities to their so- 
cial surroundings; and bankruptcy law defines the priority of organizations’ 
competing obligations to their various stakeholders (Delaney 1989). 


22Rational decision-making holds, at best, a subordinate position in most culturalist accounts 
of constitutive law. While self-interested actors may tinker instrumentally at the margins, funda- 
mental legal forms go mostly unquestioned After all, if law shapes our understanding of reality, 
organizations have no truly extra-legal vantage point from which to strategize; indeed, as “artificial 
persons" even their capacity to engage 1n instrumental actrvity is, itself, largely a legal construct. 

23 It is perhaps worth re-emphasizing that we use the term “constitutive legal environment" to refer 
to law’s impact ın constituting organizations, rather than to organizations’ impact in constituting 
law. As we argue in the teat, organizations and professions do play a constitutive role in the social 
construction of law; however, such effects are at least as likely to arise in the contexts that we, 
here, label “regulatory” as in those that we label “constitutive.” Indeed, one might argue that 
the taken-for-grantedness of constitutive legal definitions actually renders them less susceptible to 
“bottom-up” influences than are more openly contested regulatory legal mandates. 
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Beyond providing a “vocabulary of significant gestures” (Mead 1934) that 
pre-existing organizations employ in their interactions, law also helps to con- 
struct organizational structure, itself. In a limited way, this occurs whenever 
law defines the characteristics of a particular organizational feature, such as an 
affirmative action policy (Edelman 1990), or a “poison pill" takeover defense 
(Powell 1993, Davis 1991). More broadly, law also codifies ground-rules for 
entire organizational forms, suggesting alternative models for collective action 
and imbuing each with certain chartered attributes and capacities (cf Meyer, 
Boli & Thomas 1987, Krasner 1988). To paraphrase Scott (1993), organiza- 
tions are not “real” primordial creatures, but are social constructions, defined 
and given meaning in large part by legal institutions. Historical and compar- 
ative work, for example, reveals the central role of law in constructing the 
modern limited-liability corporation (Coleman 1974, 1990, Seavoy 1982, Roy 
1990, Creighton 1990, Klein & Majewski 1992) and in shaping the boundaries 
between private firms, public agencies, collective enterprises, and nonprofit 
organizations (Nee 1992, Hansman 1996, Campbell & Lindberg 1990). More- 
over, the available evidence suggests that such issues of legal form are often 
significant determinants of organizational behavior and performance (Haveman 
1993, Rao & Neilsen 1992, but cf Macaulay 1963). 

Finally, the legal system sometimes constructs organizational practices less 
directly, simply by contributing to the elaboration of an underlying cultural 
logic of "legal-rationality" As Weber (1947) observed, modern organizations 
and modern law both embrace a legal-rational orientation, and consequently 
the two realms share a great deal of their justificatory rhetoric. Although it 
would be wrong to see legal-rationality as entirely a product of the formal law, 
it is clear that legal definitions often generate sympathetic vibrations in other 
legal-rational social structures, including organizations (cf Sitkin & Bies 1993). 
Thus, organizational embodiments of "fairness" (Edelman 1990); “citizenship” 
(Selznick 1969, Monahan et al 1994); “negligence” (Ellickson 1986, 1990), and 
the like tend to mimic their legal counterparts.”4 

To the extent that laws construct organizations rather than merely regulating 
them, the ability of organizations to exert a truly exogenous impact on law 
would seem to be quite limited—particularly when legal definitions are stable, 
uniform, and fully institutionalized. Legal definitions, however, are rarely fully 


25 Barly hints of this analogical outlook can be found in Selznick's (1969) foundational work 
on the evolution of legality. Although organizations occasionally adopt procedural safegaurds 
1n response to regulatory mandates, on the whole these procedures spread simply because they 
become the definition of fair treatment. In this sense, by creating and publicizing forms of legal 
procedure, the law establishes models that constitute the meaning of legality even ın extra-legal 
settings Further, the reconstruction of these procedural elements within organizations more or less 
parallels the bottom-up reconstruction of other legal devices within organizational communities, 
as described below. 


LEGAL ENVIRONMENTS OF ORGANIZATIONS 505 


institutionalized, and this opens the door for a number of investigations into 
constitutive law as a dependent variable. 

First, to the extent that organizations enjoy sufficient cognitive latitude to cre- 
ate genuine innovations, these novel devices may eventually serve as the basis 
for new legal categories and definitions (e.g. Powell 1993, Edelman 1992, 1996, 
Suchman 1994a, 1995a, Suchman & Cahill 1996). Second, even without engag- 
ing in intentional innovation, organizations may inadvertently destabilize the 
constitutive legal environment by implementing established forms imperfectly 
(see e.g. Westney 1987, Lewis & Siebold 1993) or by elaborating established 
forms unsuitably (see, e.g., McBarnett 1992, Delaney 1989, Barney, Edwards 
& Ringleb 1992). In this way, organizations often generate legal pluralism 
(Merry 1988) and legal contradiction (Tushnet 1984) as an unintended conse- 
quence of everyday activity. Third, given a modicum of pre-existing pluralism 
and contradiction, organizations may skew the development of constitutive law 
by favoring some legal alternatives over others: Assuming that legal definitions 
operate like other institutional models, specific formulations will tend to gain 
cognitive legitimacy as the number of visible organizational examplars rises (cf 
Tolbert & Zucker 1983, Hannan & Carroll 1992, Aldrich & Fiol 1994, Edelman 
1990, Suchman 1995b). 

Finally, and perhaps most importantly, organizations (and organized profes- 
sions) participate actively in the social construction processes that give new 
laws their meanings. This applies not only to the regulatory laws described 
in the previous section, but to constitutive laws as well. Initially, new statu- 
tory definitions often provide an abstract vocabulary without concrete referents. 
When called upon to apply this vocabulary, judges look to the extra-legal world 
for “data points" that might give substance to the law’s abstract labels. In sev- 
eral fields, for example, courts openly employ "trade practices" and "industry 
standards" as key interpretive guideposts (Suchman & Cahill 1996). The same 
inductive process occurs within organizations, too, as employees and managers 
come to understand the categories of the legal system in terms of the examples 
that they see in their own workplaces (Scheid-Cook 1992, Fuller et al 1997). 
Consequently, the prevailing routines of organizational communities frequently 
define the content of legal categories. 

Suchman's work on the role of lawyers in Silicon Valley (Suchman 1994a,b, 
1995a, Suchman & Cahill 1996) illustrates a number of these dynamics. Be- 
cause the formation of high-technology start-up companies was only weakly 
institutionalized within the larger socio-legal environment, Silicon Valley's le- 
gal, financial, and industrial sectors enjoyed substantial leeway to experiment 
during the community's early years. Such freedom, however, came at the price 
of profound cognitive uncertainty, and from 1975 to 1990, the region's lead- 
ing law firms and venture capital funds worked to moderate this uncertainty 
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by fostering community-level normative and cognitive structures, and by in- 
troducing Silicon Valley customs into the rules of the larger legal system. In 
particular, due to their distinctive structural position, local law firms were able to 
play a central role as inter-organizational pollinators, compiling case-by-case 
innovations into standardized constitutive formulae. Over time, the result- 
ing "Silicon Valley model" of routinized venture capital financing gradually 
achieved codification both in legal form-books and in national securities law, ul- 
timately becoming a virtually taken-for-granted element ofthe constitutive legal 
environment.” 

Beyond the emergence and institutionalization of particular legal devices, the 
systemic impact of organizations on constitutive law is perhaps clearest in 
the case of the legal-rational “sympathetic vibrations” described above. While 
the legal system enjoys cultural primacy with regard to the “legal” components 
of legal-rationality, the organizational world often enjoys a similar position of 
leadership with regard to the “rational” components. Bureaucratic notions 
of planning, efficiency, and economic necessity exert a substantial impact on 
judicial doctrines regarding reasonable responses to law as well as on legislative 
understandings of wise policy-making. Moreover, just as organizations mimic 
legal models of justice, legal institutions often mimic organizational models 
of efficiency (Heydebrand & Seron 1990, Savelsberg 1992). The debate over 
the legalization of organizational life has its parallel in the debate over the 
bureaucratization of legal life. 

In reality, of course, both legal and organizational actors collaborate in 
the elaboration of a single, endogenous regime that encompasses (and tran- 
scends) both sectors. Endogeneity may be a bit harder to see in the case of the 
constitutive legal environment—which often operates as a set of background 
assumptions—than in the case of the more openly contested regulatory envi- 
ronment; nonetheless, the juxtaposition of legallv constituted organizations and 
organizationally embodied laws creates a substantial potential for coevolution 
between the two. Even as organizations are passively constructed by highly 
institutionalized elements of the constitutive legal environment, those same or- 
ganizations may actively participate in the discursive elaboration of other, less 
well-institutionalized elements. In this regard, organizations are no different 
from other social actors, whose ability to change their culture is predicated on 
their constitution and empowerment by that culture. 


25Suchman’s findings can be contrasted, to some extent, with Macaulay's (1963) classic study 
of "non-contractual relations" 1n business. In the terms of the present discussion, Macaulay’s 
research highlights the ability of local community norms to buffer organizational interactions from 
the impact of legal categories and definitions—a finding replicated in a number of other studies in 
the Jaw and society tradition as well (e g. Ross 1980, Ellickson 1986). 
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Toward an Integrative Model 


The sociology of law and organizations has experienced something of a renais- 
sance recently as it has become clear that organizations interact with their legal 
environments in multiple contexts. Law and, more broadly, legal environments 
create, constrain, shape, enable, define, and empower organizations. Law and 
legal environments, moreover, provide forums and templates for interactions 
among organizations. At the same time, organizations, for their part, provide 
an important terrain for the interpretation of legal rules and for the structuration 
of legal environments. 

We have identified three facets of organizations’ legal environments—facili- 
tative, regulatory, and constitutive—and we have argued that on each of these 
dimensions, causation may flow in both directions (albeit to varying extents). 
Facilitative environments provide the tools for legal interaction and dispute res- 
olution but those tools are constantly transformed in the organizational context. 
Regulatory environments place law in the posture of seeking to control organi- 
zations, but much regulation grows out of organizations' actions and agendas, 
and organizational responses to regulation often define the meaning of compli- 
ance. Constitutive environments come into play as legal forms, and categories 
help to define the very nature of organizations and their environments; but the 
rules, principles, and values that comprise legal forms and categories often arise 
out of organizational practices and norms. 

We have also pointed to two metatheoretical perspectives on organizations 
and law: the materialist approach, which views law as a set of economic in- 
centives and organizations as rational-actors; and the cultural approach, which 
sees both law and organizations as socially constructed phenomena. Litera- 
ture in the materialist camp emphasizes the facilitative and regulatory facets of 
the legal environment, whereas literature in the cultural camp emphasizes the 
regulatory and constitutive. 

In short, we have constructed a typology of three facets of the legal environ- 
ments of organizations, crossed by two metatheoretical perspectives. Future 
work on law and organizations should expand this typology from a static set 
of categories into a comprehensive understanding of the dynamics linking the 
three facets of the legal environment, and of the interplay between materialist 
and culturalist concerns. In our view, such a dynamic model should pursue 
three objectives. 

First, scholars should study interactions among all three types of legal envi- 
ronments. Regulatory law, for example, may often achieve its effects through 
the facilitative environment, as when organizations accuse each other of anti- 
trust violations, civil rights violations, labor law violations, or false advertising. 
The constitutive environment, for its part, defines the legally cognizable actors 
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and actions that can enter into the arenas of facilitative law. And outcomes of 
facilitative law often need to be enforced by regulatory agents of the state. Per- 
haps most strikingly, essentially regulatory statutes often generate substantial 
constitutive spillovers, as the social construction of compliance yields new legal 
and bureaucratic devices (such as bureaucratized personnel reviews or financial 
“due diligence” investigations), which then become reified as taken-for-granted 
building blocks of organizational structure. 

Second, scholars in the materialist and cultural camps, respectively, must 
each address the findings and concerns of the other. In particular, research should 
consider the relative import of rationality on the one hand, and normative and 
cognitive beliefs on the other—and should devote special attention to the social 
conditions or locations that lead each mode of explanation to become most 
valid. For example, under what conditions do organizations behave rationally 
in response to law, and under what conditions do ideas about rationality, itself, 
change in response to legal and social norms? To what extent do legislatures, 
courts, and administrative agencies incorporate organizational constructions of 
rationality, and vice versa? To a large degree, of course, the debate between 
material and cultural perspectives reflects an analogous debate in organization 
theory generally, and the same questions ought to be asked in other contexts. 
But it may be that legal environments provide an especially interesting context 
in which to examine these issues, because laws are intended both to set forth 
general normative principles and to impose costs and benefits for particular 
organizational behaviors. 

Third, in order to address the connections between the three facets of the 
legal environment and to mediate between the materialist and cultural perspec- 
tives, researchers should develop more specific models of the endogeneity that 
exists between law and organizations. With luck, in explaining the processes 
by which organizations incorporate legal ideals and the processes by which 
the law incorporates organizational models, we will better discern the relation- 
ship among rational, normative and cognitive factors or better understand the 
interaction between facilitative, regulatory, and constitutive environments. 
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